Lecture

Entity Resolution: Techniques and Applications

In course

Nisi nulla irure irure quis irure. Duis consequat irure veniam minim culpa nisi laboris deserunt occaecat ad ea sit. In dolor irure aute ut commodo. Et et magna id id cillum cillum Lorem ex. Sint duis magna duis incididunt enim.

Description

This lecture covers the concept of entity resolution (ER), which involves identifying and aggregating different entity profiles that refer to the same real-world entity across datasets. Topics include duplicate elimination, record linkage, similarity metrics, data deduplication, and possible repairs. The instructor also discusses the challenges of dealing with duplicate entities, such as name/attribute ambiguity and errors due to data entry. Various techniques like clustering, blocking, q-gram set join, and ClusterJoin algorithm are explained in detail to handle duplicate detection and entity clustering efficiently.

Instructor

do elit

Mollit velit in consectetur mollit cupidatat ullamco ex. Non irure commodo id eiusmod incididunt eiusmod irure duis excepteur excepteur. Deserunt sunt nisi fugiat consectetur ad sint quis commodo dolore amet pariatur ullamco.

Official source

https://mediaspace.epfl.ch/media/0_vd0vs79s

About this result

This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.

Ontological neighbourhood

Information engineering

Machine learning: Unsupervised learning

Business

Business administration: Data management

Related lectures (32)

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.