Lecture

Words Tokens: Lexical Level Overview

Description

This lecture delves into the fundamental concepts of words, tokens, and language models in Natural Language Processing (NLP). It starts by discussing the challenges in defining words and tokens, emphasizing the importance of context. The lecture explores the distinction between words and tokens, the role of lexicons in NLP systems, and the use of n-grams for language modeling. It covers the implementation of lexica, access methods, and the significance of surface forms. Additionally, it explains the estimation of probabilities in language models, including additive smoothing techniques. The lecture concludes by highlighting the key points related to lexica usage, tokenization challenges, the effectiveness of n-grams, and smoothing methods.

This video is available exclusively on Mediaspace for a restricted audience. Please log in to MediaSpace to access it if you have the necessary permissions.

Watch on Mediaspace

Official source

https://mediaspace.epfl.ch/media/0_4qbl105k

About this result

This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.

Ontological neighbourhood

Statistics

Statistical inference: Mathematical statistics, Bayesian statistics

Information engineering

Natural language processing: Topics in natural language processing

Linguistics

Theoretical linguistics: Syntax

Related lectures (61)

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.