Publication

A Method for Record Linkage with Sparse Historical Data

Publications associées (39)

From Archival Sources to Structured Historical Information: Annotating and Exploring the "Accordi dei Garzoni"

Frédéric Kaplan, Maud Ehrmann, Orlin Biserov Topalov

If automatic document processing techniques have achieved a certain maturity for present time documents, the transformation of hand-written documents into well-represented, structured and connected data which can satisfactorily be used for historical study ...
Routledge, Taylor & Francis Group2023

Examining European Press Coverage of the Covid-19 No-Vax Movement: An NLP Framework

Daniel Gatica-Perez

This paper examines how the European press dealt with the no-vax reactions against the Covid-19 vaccine and the dis- and misinformation associated with this movement. Using a curated dataset of 1786 articles from 19 European newspapers on the anti-vaccine ...
ASSOC COMPUTING MACHINERY2023

Introducing the HIPE 2022 Shared Task: Named Entity Recognition and Linking in Multilingual Historical Documents

Maud Ehrmann, Matteo Romanello

We present the HIPE-2022 shared task on named entity processing in multilingual historical documents. Following the success of the first CLEF-HIPE-2020 evaluation lab, this edition confronts systems with the challenges of dealing with more languages, learn ...
Springer International Publishing2022

Strong Heuristics for Named Entity Linking

Robert West, Akhil Arora, Marko Culjak, Andreas Oliver Spitz

Named entity linking (NEL) in news is a challenging endeavour due to the frequency of unseen and emerging entities, which necessitates the use of unsupervised or zero-shot methods. However, such methods tend to come with caveats, such as no integration of ...
ASSOC COMPUTATIONAL LINGUISTICS-ACL2022

A Two-Step Approach To Leverage Contextual Data: Speech Recognition In Air-Traffic Communications

Petr Motlicek, Juan Pablo Zuluaga Gomez, Amrutha Prasad

Automatic Speech Recognition (ASR), as the assistance of speech communication between pilots and air-traffic controllers, can significantly reduce the complexity of the task and increase the reliability of transmitted information. ASR application can lead ...
IEEE2022

HIPE-2022 Shared Task Named Entity Datasets

Maud Ehrmann, Matteo Romanello

HIPE-2022 datasets used for the HIPE 2022 shared task on named entity recognition and classification (NERC) and entity linking (EL) in multilingual historical documents. HIPE-2022 datasets are based on six primary datasets assembled and prepared for the sh ...
2022

A Posthumous Rivalry: On Borelli and Viviani's Relationship between the Accademia del Cimento and en Eighteenth-Century Controversy

Simon François Dumas Primbault

Between around 1656 and the late 1660s, Giovanni Alfonso Borelli and Vincenzio Viviani, self-proclaimed last disciple of Galileo, collaborated on a host of math- ematical, physico-mathematical, and experimental problems, notably within the Acca- demia del ...
2022

Low-Rank Subspaces for Unsupervised Entity Linking

Robert West, Akhil Arora, Alberto Garcia Duran

Entity linking is an important problem with many applications. Most previous solutions were designed for settings where annotated training data is available, which is, however, not the case in numerous domains. We propose a light-weight and scalable entity ...
ASSOC COMPUTATIONAL LINGUISTICS-ACL2021

Discourse Phenomena in Machine Translation

Lesly Sadiht Miculicich Werlen

Machine Translation (MT) has made considerable progress in the past two decades, particularly after the introduction of neural network models (NMT). During this time, the research community has mostly focused on modeling and evaluating MT systems at the se ...
EPFL2021

CLEF-HIPE-2020 Shared Task Named Entity Datasets

Maud Ehrmann, Matteo Romanello

CLEF-HIPE-2020 (Identifying Historical People, Places and other Entities) is a evaluation campaign on named entity processing on historical newspapers in French, German and English, which was organized in the context of the impresso project and run as a CL ...
2020

Graph Chatbot

Chattez avec Graph Search

Posez n’importe quelle question sur les cours, conférences, exercices, recherches, actualités, etc. de l’EPFL ou essayez les exemples de questions ci-dessous.

AVERTISSEMENT : Le chatbot Graph n'est pas programmé pour fournir des réponses explicites ou catégoriques à vos questions. Il transforme plutôt vos questions en demandes API qui sont distribuées aux différents services informatiques officiellement administrés par l'EPFL. Son but est uniquement de collecter et de recommander des références pertinentes à des contenus que vous pouvez explorer pour vous aider à répondre à vos questions.