Visual Speaker Localization Aided by Acoustic Models
Related publications (33)
Graph Chatbot
Chat with Graph Search
Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.
DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.
In architecture, space is traditionally understood as a cumulus of forms, colors and textures that form a conceived area, or-in other words-architectural space design is mainly achieved by following visual concepts. The human experience of space, however, ...
We cast the under-determined convolutive speech separation as sparse approximation of the spatial spectra of the mixing sources. In this framework we compare and contrast the major practical algorithms for structured sparse recovery of speech signal. Speci ...
Depth imaging is commonly based on light. For example, LIDAR and Kinect use infrared light, while stereo cameras use visible light. These systems require hardware operating at high sampling frequencies, precise calibration, and they dissipate significant p ...
A sound field on a line or in a plane has an effectively limited spatial bandwidth determined by the temporal frequency. Similar can be said for sound fields from far-field sources when analyzed on circular and spherical apertures. Namely, for a given freq ...
A compelling method to calibrate the positions of microphones in an array is with sources at unknown locations. Remarkably, it is possible to reconstruct the locations of both the sources and the receivers, if their number is larger than some prescribed mi ...
Sound field reproduction using Wave Field Synthesis has been so far limited to the positioning of virtual sources and listeners in the horizontal plane only although the underlying formulation (Kirchhoff-Helmholtz) describes the reproduction of 3 dimensional ...
We propose a novel method for single-channel microphone localization inside a known room. Unlike other approaches, we take advantage of the room reverberation, which enables us to use only a single fixed loudspeaker to localize the microphone. Our method u ...
We cast the under-determined convolutive speech separation as sparse approximation of the spatial spectra of the mixing sources. In this framework we compare and contrast the major practical algorithms for structured sparse recovery of speech signal. Speci ...
Sound field reproduction using Wave Field Synthesis has been so far limited to the positioning of virtual sources and listeners in the horizontal plane only although the underlying formulation (Kirchhoff-Helmholtz) describes the reproduction of 3 dimension ...
Verband Deutscher Tonmeister and ETI, University of Music Detmold2011
This paper presents a new method for learning overcomplete dictionaries adapted to efficient joint representation of stereo images. We first formulate a sparse stereo image model where the multi-view correlation is described by local geometric transforms o ...
Institute of Electrical and Electronics Engineers2011