Lecture

Handling Text Data: Document Retrieval and Classification

Description

This lecture covers the handling of text data, focusing on document retrieval and classification. Topics include typical tasks like sentiment analysis and topic detection, the use of TF-IDF matrices, and the challenges of sparsity in text data. The instructor introduces the concept of bag-of-words and discusses the application of matrix factorization techniques. The lecture also delves into the use of contextualized word vectors, such as BERT, for more advanced natural language processing tasks. The NLP pipeline, from tokenization to coreference resolution, is explained, along with the importance of contextualized word vectors in modern NLP models.

About this result
This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.