Are you an EPFL student looking for a semester project?
Work with us on data science and visualisation projects, and deploy your project as an app on top of Graph Search.
This lecture covers various algorithms and techniques for information extraction, including the Viterbi algorithm, named entities recognition, part-of-speech tags, and word n-grams. It explores the use of hand-written patterns, supervised machine learning, bootstrapping, and distant supervision. The instructor discusses the challenges of semantic drift and the use of knowledge bases for distant supervision. Matrix factorization and Bayesian personalized ranking are introduced as methods for relation extraction. The lecture also delves into linking text to knowledge bases, creating matrix representations, and utilizing relation embeddings.