This lecture covers the concept of data sampling as a paradigm shift to fit data in memory, reduce spilling to disk, and optimize processing time in engines like Volcano-style and JIT compiled engines. It discusses strategies to reduce sampling overheads, shared operators, and datapath search strategies for efficient query processing. The instructor presents experimental setups, benchmark results, and future plans for optimizing query execution.
This video is available exclusively on Mediaspace for a restricted audience. Please log in to MediaSpace to access it if you have the necessary permissions.
Watch on Mediaspace