Explores the landscape of big data, memory importance in online services, challenges faced by memory systems, emerging DRAM technologies, and storage-class memory.
Covers data science tools, Hadoop, Spark, data lake ecosystems, CAP theorem, batch vs. stream processing, HDFS, Hive, Parquet, ORC, and MapReduce architecture.