Lecture

Neural Networks: Transformers

Description

This lecture covers the concept of transformers in neural networks, focusing on sequence-to-sequence transformations. It explains how various data types like words, images, and multimodal data can be represented as sequences and processed using transformers. The lecture delves into the architecture of transformers, including self-attention mechanisms, multi-head self-attention, and the importance of positional information. It also discusses the role of transformers in tasks like sentiment classification, translation, and image description. The presentation concludes with insights on the vision transformer architecture and the capabilities of transformers in capturing long-range dependencies.

About this result
This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.