This lecture discusses a Massive Scale Data Processing System at EPFL, focusing on query engines like Hive/Spark for ad hoc queries and the reduction of processing costs for highly concurrent workloads. It covers shared work systems, shared query executors, batch query optimizers, and adaptive data placement strategies.
This video is available exclusively on Mediaspace for a restricted audience. Please log in to MediaSpace to access it if you have the necessary permissions.
Watch on Mediaspace