Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.
DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.
Viewers of 360-degree videos are provided with both visual modality to characterize their surrounding views and audio modality to indicate the sound direction. Though both modalities are important for saliency prediction, little work has been done by joint ...
We present an extensive evaluation of a wide variety of promising design patterns for automated deep-learning (AutoDL) methods, organized according to the problem categories of the 2019 AutoDL challenges, which set the task of optimizing both model accura ...
In this paper we present Aligned Scores and Performances (ASAP): a new dataset of 222 digital musical scores aligned with 1068 performances (more than 92 hours) of Western classical piano music.The scores are provided as paired MusicXML files and quantized ...
New materialism considers that the world and its histories are produced by a range of material forces that extend from the physical and the biological to the psychological, social and cultural. In recognizing that heritage is not held in objects alone, new ...
Understanding the influence of running-induced acute fatigue on the homeostasis of the body is essential to mitigate the adverse effects and optimize positive adaptations to training. Fatigue is a multifactorial phenomenon, which influences biomechanical, ...
This paper introduces a novel approach for extracting speaker embeddings from audio mixtures of multiple overlapping voices. This approach is based on a multi-task neural network. The network first extracts a latent feature for each direction. This feature ...
In this paper, we introduce our recent studies on human perception in audio event classification. In particular, the pre-trained model VGGish is used as feature extractor to process audio data, and DenseNet is trained by and used as feature extractor for o ...
On most musical instruments, especially on the guitar, it is possible to play the same note or chord in multiple ways. In this project, we develop a simple audio-based method to estimate the fingering of a note played on an acoustic guitar from a recording ...
Performing speaker diarization while uniquely identifying the speakers in a collection of audio recordings is a challenging task. Based on our previous work on speaker diarization and linking, we developed a system for diarizing longitudinal TV show data s ...
Concept maps can be used as generative assessment tools to identify changes in learner’s understanding. However, concept map analysis usually only focuses on the final product. This case study used a talk aloud protocol to study and compare the concept map ...