Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.
DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.
This paper presents a novel probabilistic framework for localizing multiple speakers with a microphone array. In this framework, the generalized cross correlation function (GCC) of each microphone pair is interpreted as a probability distribution of the ti ...
This work aims at investigating the automatic recognition of speaker role in meeting conversations from the AMI corpus. Two types of roles are considered: formal roles, fixed over the meeting duration and recognized at recording level, and social roles rel ...
This paper presents a novel probabilistic framework for localizing multiple speakers with a microphone array. In this framework, the generalized cross correlation function (GCC) of each microphone pair is interpreted as a probability distribution of the ti ...
Sound field reproduction using Wave Field Synthesis has been so far limited to the positioning of virtual sources and listeners in the horizontal plane only although the underlying formulation (Kirchhoff-Helmholtz) describes the reproduction of 3 dimension ...
Verband Deutscher Tonmeister and ETI, University of Music Detmold2011
Super-directional loudspeaker arrays can be used to achieve high directivity in a limited low-frequency range. As opposed to microphone arrays, the distance between the loudspeakers has to be relatively large, resulting in aliasing starting at relatively l ...
Accurate detection, localization and tracking of multiple moving speakers permits a wide spectrum of applications. Techniques are required that are versatile, robust to environmental variations, and not constraining for non-technical end-users. Based on di ...
A sound field on a line or in a plane has an effectively limited spatial bandwidth determined by the temporal frequency. Similar can be said for sound fields from far-field sources when analyzed on circular and spherical apertures. Namely, for a given freq ...
Sound field reproduction using Wave Field Synthesis has been so far limited to the positioning of virtual sources and listeners in the horizontal plane only although the underlying formulation (Kirchhoff-Helmholtz) describes the reproduction of 3 dimensional ...
We cast the under-determined convolutive speech separation as sparse approximation of the spatial spectra of the mixing sources. In this framework we compare and contrast the major practical algorithms for structured sparse recovery of speech signal. Speci ...
We cast the under-determined convolutive speech separation as sparse approximation of the spatial spectra of the mixing sources. In this framework we compare and contrast the major practical algorithms for structured sparse recovery of speech signal. Speci ...