Are you an EPFL student looking for a semester project?
Work with us on data science and visualisation projects, and deploy your project as an app on top of Graph Search.
This lecture by the instructor focuses on machine visual perception, covering topics such as difficulties in machine visual perception, applications like face detection, learning with noisy labels, and the use of synthetic data for action recognition. The lecture also delves into weakly-supervised training, the Speech2Action model, and the Zero-shot VideoQA approach. The instructor discusses the challenges and benefits of weakly-supervised learning, the impact of temporal extent on 3D convolutions, and the VectorNet model for behavior prediction in cars. The lecture concludes with insights on future research directions towards intelligent systems, including multimodal data analysis and interaction with the world.