Lecture

Words Tokens: Lexical Level Overview

Description

This lecture delves into the fundamental concepts of words, tokens, and language models in Natural Language Processing (NLP). It starts by discussing the challenges in defining words and tokens, emphasizing the importance of context. The lecture explores the distinction between words and tokens, the role of lexicons in NLP systems, and the use of n-grams for language modeling. It covers the implementation of lexica, access methods, and the significance of surface forms. Additionally, it explains the estimation of probabilities in language models, including additive smoothing techniques. The lecture concludes by highlighting the key points related to lexica usage, tokenization challenges, the effectiveness of n-grams, and smoothing methods.

This video is available exclusively on Mediaspace for a restricted audience. Please log in to MediaSpace to access it if you have the necessary permissions.

Watch on Mediaspace
About this result
This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.