Covers the exponential growth of data, challenges in processing technology, data variety, cleaning, approximate query processing, multi-query analytics, and hybrid transaction processing.
Covers data stream processing with Apache Kafka and Spark, including event time vs processing time, stream processing operations, and stream-stream joins.