Are you an EPFL student looking for a semester project?
Work with us on data science and visualisation projects, and deploy your project as an app on top of Graph Search.
This lecture covers the integration of scalable data storage and map reduce processing using Hadoop. Topics include HDFS, Hive, Parquet, ORC, Spark, and HBase. It explains the importance of key design in HBase, Hive SerDe with JSON, and HBase architecture.