Lecture

Transformer Networks: Self-Attention

Description

This lecture covers Transformer networks and self-attention layers, explaining how they map sets of inputs and the concept of multi-head attention. It delves into the process of learning weights, the importance of positional encoding, and the interpretability of the heads.

Instructor

ex sit adipisicing veniam

Commodo magna quis reprehenderit qui voluptate minim incididunt adipisicing duis adipisicing ad. Velit ex nulla reprehenderit elit minim ut sint laboris. Irure magna et ad aute excepteur ut ullamco magna culpa commodo. Laboris ipsum deserunt aliquip dolor enim adipisicing ipsum. Aliqua laboris Lorem labore labore labore nisi nostrud commodo commodo velit do commodo excepteur laborum. Mollit duis quis sint tempor.

Official source

https://mediaspace.epfl.ch/media/0_td8k8hm4

About this result

This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.

Ontological neighbourhood

Mathematics

Mathematical logic: Set theory

Information engineering

Machine learning: Artificial neural networks

Linguistics

Theoretical linguistics: Syntax

Related lectures (40)

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.