Spatio-Temporal Analysis of Spontaneous Speech with Microphone Arrays
Graph Chatbot
Chattez avec Graph Search
Posez n’importe quelle question sur les cours, conférences, exercices, recherches, actualités, etc. de l’EPFL ou essayez les exemples de questions ci-dessous.
AVERTISSEMENT : Le chatbot Graph n'est pas programmé pour fournir des réponses explicites ou catégoriques à vos questions. Il transforme plutôt vos questions en demandes API qui sont distribuées aux différents services informatiques officiellement administrés par l'EPFL. Son but est uniquement de collecter et de recommander des références pertinentes à des contenus que vous pouvez explorer pour vous aider à répondre à vos questions.
We propose a novel formulation of the generalized cross correlation with phase transform (GCC-PHAT) for a pair of microphones in diffuse sound field. This formulation elucidates the links between the microphone distances and the GCC-PHAT output. Hence, it ...
Speaker diarization is the task of identifying “who spoke when” in an audio stream containing multiple speakers. This is an unsupervised task as there is no a priori information about the speakers. Diagnostical studies on state-of-the-art diarization syste ...
This paper introduces a non-linear vector-based feature mapping approach to extract robust features for au- tomatic speech recognition (ASR) of overlapping speech using a microphone array. We explore different configurations and additional sources of infor ...
Depth imaging is commonly based on light. For example, LIDAR and Kinect use infrared light, while stereo cameras use visible light. These systems require hardware operating at high sampling frequencies, precise calibration, and they dissipate significant p ...
A compelling method to calibrate the positions of microphones in an array is with sources at unknown locations. Remarkably, it is possible to reconstruct the locations of both the sources and the receivers, if their number is larger than some prescribed mi ...
IEEE2014
In architecture, space is traditionally understood as a cumulus of forms, colors and textures that form a conceived area, or-in other words-architectural space design is mainly achieved by following visual concepts. The human experience of space, however, ...
Springer2015
, ,
Sound field reproduction using Wave Field Synthesis has been so far limited to the positioning of virtual sources and listeners in the horizontal plane only although the underlying formulation (Kirchhoff-Helmholtz) describes the reproduction of 3 dimensional ...
Société Française d'Acoustique2012
, , , ,
A novel localization approach is proposed in order to find the position of an individual source using recordings of a single microphone in a reverberant enclosure. The multipath propagation is modeled by multiple virtual microphones as images of the actual ...
We propose a novel method for single-channel microphone localization inside a known room. Unlike other approaches, we take advantage of the room reverberation, which enables us to use only a single fixed loudspeaker to localize the microphone. Our method u ...
IEEE2014
, , ,
This paper proposes a speech localization framework based on model-based sparse recovery. We compare and contrast the computational sparse optimization methods incorporating harmonicity and block structures as well as autoregressive dependencies underlying ...