Transformers: Pre-Training

About
Privacy
Disclaimer

Graph Chatbot

Related lectures (30)

Page 2 of 3

Modern NLP and Ethics in NLP

Delves into advancements and challenges in NLP, along with ethical considerations and potential harms.

Generative Models: Self-Attention and Transformers

Covers generative models with a focus on self-attention and transformers, discussing sampling methods and empirical means.

Natural Language Processing: Understanding Transformers and Tokenization

Provides an overview of Natural Language Processing, focusing on transformers, tokenization, and self-attention mechanisms for effective language analysis and synthesis.

Deep Learning for NLP

Explores deep learning for NLP, covering word embeddings, context representations, learning techniques, and challenges like vanishing gradients and ethical considerations.

Coreference Resolution: Models and Evaluation

Explores coreference resolution models, challenges in scoring spans, graph refinement techniques, state-of-the-art results, and the impact of pretrained Transformers.

Decoding from Neural Models

Explores decoding from neural models in modern NLP, covering encoder-decoder models, decoding algorithms, issues with argmax decoding, and the impact of beam size.

Transformers: Overview and Self-Attention

Provides an overview of Transformers, self-attention, multi-headed attention, and the Transformer decoder and encoder.

From Attention to Transformers

Explores the evolution from attention mechanisms to transformers in modern NLP, emphasizing the significance of self-attention and cross-attention.

Transformers in Vision: Applications and Architectures

Covers the impact of transformers in computer vision, discussing their architecture, applications, and advancements in various tasks.

Pretraining Sequence-to-Sequence Models: BART + T5

Explores pretraining sequence-to-sequence models with BART and T5, discussing transfer learning, fine-tuning, model architectures, tasks, performance comparison, summarization results, and references.