Lecture

Deep Visual Recognition: Interpretability

Description

This lecture by the instructor focuses on deep visual recognition and interpretability. It covers the deep learning revolution, semantic segmentation, and deformable 3D reconstruction. The lecture explores the use of Bag of Words and Bag of Visual Words models, standard CNN architectures, and the creation of visual dictionaries. It delves into experiments with datasets, visual codewords, and adversarial attack detection. The instructor also discusses dealing with complex scenes, proposing solutions for encoding local features and generating attention maps. The lecture concludes with insights on scene recognition, attention-aware pooling, and the importance of interpretability in deep networks.

About this result
This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.