Are you an EPFL student looking for a semester project?
Work with us on data science and visualisation projects, and deploy your project as an app on top of Graph Search.
This lecture by the instructor from Stanford University on efficient machine learning via data summarization covers the challenges of training on large data, identifying the 'right' data, and developing methods for data summarization. The lecture presents the research applications and impact of data summarization methods, including image summarization, clustering, sensor placement, and revenue maximization. It introduces the Greedy algorithm for submodular maximization and discusses the limitations of the Greedy approach. The lecture then delves into the development of the GreeDi algorithm for distributed data summarization and the CRAIG approach for learning from summaries, showcasing applications to logistic regression, neural networks, and deep networks.