Lecture

Integrating Scalable Data Storage and Map Reduce Processing with Hadoop

Description

This lecture covers the integration of scalable data storage and map reduce processing using Hadoop. Topics include HDFS, Hive, Parquet, ORC, Spark, and HBase. It explains the importance of key design in HBase, Hive SerDe with JSON, and HBase architecture.

About this result
This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.