Lecture

Automatic Understanding of the Visual World

Description

This lecture by the instructor focuses on machine visual perception, covering topics such as difficulties in machine visual perception, applications like face detection, learning with noisy labels, and the use of synthetic data for action recognition. The lecture also delves into weakly-supervised training, the Speech2Action model, and the Zero-shot VideoQA approach. The instructor discusses the challenges and benefits of weakly-supervised learning, the impact of temporal extent on 3D convolutions, and the VectorNet model for behavior prediction in cars. The lecture concludes with insights on future research directions towards intelligent systems, including multimodal data analysis and interaction with the world.

About this result
This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.