Are you an EPFL student looking for a semester project?
Work with us on data science and visualisation projects, and deploy your project as an app on top of Graph Search.
This lecture covers the second assignment for the CS-552: Modern NLP course, focusing on transfer learning and data augmentation. Students will implement training and evaluation of a pre-trained language model (DistilBERT) for natural language inference (NLI) tasks. They will identify model shortcuts, perform word-pair pattern extraction, annotate new data, and explore data augmentation methods. The lecture provides detailed instructions and code snippets for each part of the assignment, including model finetuning, identifying model shortcuts, and data augmentation.