ASAP: a Dataset of Aligned Scores and Performances for Piano Transcription
Graph Chatbot
Chat with Graph Search
Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.
DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.
In this paper, we introduce our recent studies on human perception in audio event classification. In particular, the pre-trained model VGGish is used as feature extractor to process audio data, and DenseNet is trained by and used as feature extractor for o ...
New materialism considers that the world and its histories are produced by a range of material forces that extend from the physical and the biological to the psychological, social and cultural. In recognizing that heritage is not held in objects alone, new ...
Acoustical knee health assessment has long promised an alternative to clinically available medical imaging tools, but this modality has yet to be adopted in medical practice. The field is currently led by machine learning models processing acoustical featu ...
With the increasing amount of video being consumed by people daily, there is a danger of the rise in maliciously modified video content (i.e., 'fake news') that could be used to damage innocent people or to impose a certain agenda, e.g., meddle in election ...
Concept maps can be used as generative assessment tools to identify changes in learner’s understanding. However, concept map analysis usually only focuses on the final product. This case study used a talk aloud protocol to study and compare the concept map ...
This paper introduces a novel approach for extracting speaker embeddings from audio mixtures of multiple overlapping voices. This approach is based on a multi-task neural network. The network first extracts a latent feature for each direction. This feature ...
ISCA-INT SPEECH COMMUNICATION ASSOC2021
, , , ,
Understanding the influence of running-induced acute fatigue on the homeostasis of the body is essential to mitigate the adverse effects and optimize positive adaptations to training. Fatigue is a multifactorial phenomenon, which influences biomechanical, ...
FRONTIERS MEDIA SA2022
We present a novel method for the compensation of long duration data loss in audio signals, in particular music. The concealment of such signal defects is based on a graph that encodes signal structure in terms of time-persistent spectral similarity. A sui ...
Performing speaker diarization while uniquely identifying the speakers in a collection of audio recordings is a challenging task. Based on our previous work on speaker diarization and linking, we developed a system for diarizing longitudinal TV show data s ...
On most musical instruments, especially on the guitar, it is possible to play the same note or chord in multiple ways. In this project, we develop a simple audio-based method to estimate the fingering of a note played on an acoustic guitar from a recording ...