This lecture introduces the field of data-intensive applications and systems, highlighting the exponential growth of data from 2010 to 2020 and the challenges it poses to processing technology. It covers the evolution of hardware to handle data efficiently, the importance of taming data variety, and the significance of data cleaning for veracity. Additionally, it explores approximate query processing, multi-query analytics, and hybrid transaction and analytical processing. The concept of HetExchange for portability across devices is discussed, emphasizing the use of heterogeneous hardware. The lecture concludes with insights on real-time intelligence, just-in-time query engines, and hardware-aware data management, aiming to build lean and agile data systems.
This video is available exclusively on Mediaspace for a restricted audience. Please log in to MediaSpace to access it if you have the necessary permissions.
Watch on Mediaspace