Unit

Digital Humanities Laboratory

Laboratory
Related publications (350)

Querying the Digital Archive of Science: Distant Reading, Semantic Modelling and Representation of Knowledge

Alina Volynskaya

The archive of science is a place where scientific practices are sedimented in the form of drafts, protocols of rejected hypotheses and failed experiments, obsolete instruments, outdated visualizations and other residues. Today, just as science goes more a ...
EPFL2024

Post-correction of Historical Text Transcripts with Large Language Models: An Exploratory Study

Frédéric Kaplan, Maud Ehrmann, Matteo Romanello, Sven-Nicolas Yoann Najem, Emanuela Boros

The quality of automatic transcription of heritage documents, whether from printed, manuscripts or audio sources, has a decisive impact on the ability to search and process historical texts. Although significant progress has been made in text recognition ( ...
The Association for Computational Linguistics2024

Actors and Objects of Heritage Preservation. The Singular and Dual Condition of Historical Architectural Archives

Salvatore Aprea, Barbara Galimberti

In a society that recognizes the urgency of safeguarding the environment and drastically limiting land transformations and energy-intensive activities like constructing new buildings, the protection of architectural and environmental heritage is no longer ...
2024

The Zenodo communities: visibility and FAIRness of your dataset. Example at the EPFL

Alain Borel

Communities are shared areas on the Zenodo platform where projects, institutions, domains, and conferences can curate and manage their research outputs. An EPFL community https://zenodo.org/communities/epfl was created in 2013, mainly as a light-weight sol ...
2024

Transformer Models for Vision

Jean-Baptiste Francis Marie Juliette Cordonnier

The recent developments of deep learning cover a wide variety of tasks such as image classification, text translation, playing go, and folding proteins.All these successful methods depend on a gradient-based learning algorithm to train a model on massive a ...
EPFL2023

impresso Text Reuse at Scale. An interface for the exploration of text reuse data in semantically enriched historical newspapers

Maud Ehrmann, Matteo Romanello

Text Reuse reveals meaningful reiterations of text in large corpora. Humanities researchers use text reuse to study, e.g., the posterior reception of influential texts or to reveal evolving publication practices of historical media. This research is often ...
2023

The Facets of Intangible Heritage in Southern Chinese Martial Arts: Applying a Knowledge-Driven Cultural Contact Detection Approach

Yumeng Hou

Investigating the intangible nature of a cultural domain can take multiple forms, addressing for example the aesthetic, epistemic and social dimensions of its phenomenology. The context of Southern Chinese martial arts is of particular significance as it c ...
2023

Lausanne Historical Censuses Dataset HTR 35k

Lucas Arnaud André Rappo, Rémi Guillaume Petitpierre, Marion Kramer

This training dataset includes a total of 34,913 manually transcribed text segments. It is dedicated to the handwritten text recognition (HTR) of historical sources, typically tabular records, such as censuses. This dataset is based on a sample of 83 pages ...
Zenodo2023

From Archival Sources to Structured Historical Information: Annotating and Exploring the "Accordi dei Garzoni"

Frédéric Kaplan, Maud Ehrmann, Orlin Biserov Topalov

If automatic document processing techniques have achieved a certain maturity for present time documents, the transformation of hand-written documents into well-represented, structured and connected data which can satisfactorily be used for historical study ...
Routledge, Taylor & Francis Group2023

Where Did the News Come From? Detection of News Agency Releases in Historical Newspapers

Lea Marxen

Since their beginnings in the 1830s and 1840s, news agencies have played an important role in the national and international news market, aiming to deliver news as fast and as reliable as possible. While we know that newspapers have been using agency conte ...
2023

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.