Publications associées (15)

Post-correction of Historical Text Transcripts with Large Language Models: An Exploratory Study

Frédéric Kaplan, Maud Ehrmann, Matteo Romanello, Sven-Nicolas Yoann Najem, Emanuela Boros

The quality of automatic transcription of heritage documents, whether from printed, manuscripts or audio sources, has a decisive impact on the ability to search and process historical texts. Although significant progress has been made in text recognition ( ...
The Association for Computational Linguistics2024

Lausanne Historical Censuses Dataset HTR 35k

Lucas Arnaud André Rappo, Rémi Guillaume Petitpierre, Marion Kramer

This training dataset includes a total of 34,913 manually transcribed text segments. It is dedicated to the handwritten text recognition (HTR) of historical sources, typically tabular records, such as censuses. This dataset is based on a sample of 83 pages ...
Zenodo2023

1805-1898 Census Records of Lausanne : a Long Digital Dataset for Demographic History

Isabella Di Lenardo, Lucas Arnaud André Rappo, Rémi Guillaume Petitpierre, Marion Kramer

This historical dataset stems from the project of automatic extraction of 72 census records of Lausanne, Switzerland. The complete dataset covers a century of historical demography in Lausanne (1805-1898), which corresponds to 18,831 pages, and nearly 6 mi ...
Zenodo2023

Digitised Newspapers – A New Eldorado for Historians? Reflections on Tools, Methods and Epistemology

Maud Ehrmann, Frédéric Clavert

The application of digital technologies to historical newspapers has changed the research landscape historians were used to. An Eldorado? Despite undeniable advantages, the new digital affordance of historical newspapers also transforms research practices ...
De Gruyter2022

Combining Visual and Textual Features for Semantic Segmentation of Historical Newspapers

Frédéric Kaplan, Maud Ehrmann, Sofia Ares Oliveira, Raphaël Barman

The massive amounts of digitized historical documents acquired over the last decades naturally lend themselves to automatic processing and exploration. Research work seeking to automatically process facsimiles and extract information thereby are multiplyin ...
2021

Comparing human and machine performances in transcribing 18th century handwritten Venetian script

Frédéric Kaplan, Sofia Ares Oliveira

Automatic transcription of handwritten texts has made important progress in the recent years. This increase in performance, essentially due to new architectures combining convolutional neural networks with recurrent neutral networks, opens new avenues for ...
2018

Comparing human and machine performances in transcribing 18th century handwritten Venetian script

Sofia Ares Oliveira

Automatic transcription of handwritten texts has made important progress in the recent years. This increase in performance, essentially due to new architectures combining convolutional neural networks with recurrent neutral networks, opens new avenues for ...
2018

Automatic social role recognition and its application in structuring multiparty interactions

Ashtosh Sapru

Automatic processing of multiparty interactions is a research domain with important applications in content browsing, summarization and information retrieval. In recent years, several works have been devoted to find regular patterns which speakers exhibit ...
EPFL2015

Scattering coefficients for the automatic diagnosis of perinatal asphyxia

Ivan Benjamin Baeriswyl

Perinatal Asphyxia is causing the death of about 1.2 million newborn infants every year. It is one of top three causes of infant mortality in developing countries. The current way of determining the occurrence of perinatal asphyxia is by the analysis of a ...
2015

OCR Based Slide Retrieval

Jean-Marc Odobez, Alessandro Vinciarelli, Nabil Daddaoua

This work addresses the problem of acquiring, indexing and retrieving slides in the context of automatic oral presentation processing. Since the most suitable acquisition technique, in such a context, is the use of a framegrabber (a device capturing as ima ...
IDIAP2005

Graph Chatbot

Chattez avec Graph Search

Posez n’importe quelle question sur les cours, conférences, exercices, recherches, actualités, etc. de l’EPFL ou essayez les exemples de questions ci-dessous.

AVERTISSEMENT : Le chatbot Graph n'est pas programmé pour fournir des réponses explicites ou catégoriques à vos questions. Il transforme plutôt vos questions en demandes API qui sont distribuées aux différents services informatiques officiellement administrés par l'EPFL. Son but est uniquement de collecter et de recommander des références pertinentes à des contenus que vous pouvez explorer pour vous aider à répondre à vos questions.