Contextual Representations: ELMo & BERT

About
Privacy
Disclaimer

Graph Chatbot

Related lectures (29)

Page 3 of 3

Sequence to Sequence Models: Overview and Attention Mechanisms

Explores sequence to sequence models, attention mechanisms, and their role in addressing model limitations and improving interpretability.

Transformers: Full Architecture and Self-Attention Mechanism

Explains the full architecture of Transformers and the self-attention mechanism, highlighting the paradigm shift towards using completely pretrained models.

Transformer: Pre-Training

Explores the Transformer model, from recurrent models to attention-based NLP, highlighting its key components and significant results in machine translation and document generation.

BERT: Pretraining and Applications

Delves into BERT pretraining for transformers, discussing its applications in NLP tasks.

Foundations of Deep Learning: Transformer Architecture Overview

Covers the foundational concepts of deep learning and the Transformer architecture, focusing on neural networks, attention mechanisms, and their applications in sequence modeling tasks.

Deep Learning for Question Answering

Explores deep learning for question answering, analyzing neural networks and model robustness to noise.

Model Compression: Techniques for Efficient NLP Models

Explores model compression techniques in NLP, discussing pruning, quantization, weight factorization, knowledge distillation, and attention mechanisms.

Vision-Language-Action Models: Training and Applications

Delves into training and applications of Vision-Language-Action models, emphasizing large language models' role in robotic control and the transfer of web knowledge. Results from experiments and future research directions are highlighted.

Coreference Resolution

Delves into coreference resolution, discussing challenges, advancements, and evaluation methods.