Personne

Guillaume Lathoud

Cette personne n’est plus à l’EPFL

Publications associées (53)

Spatio-temporal analysis of spontaneous speech with microphone arrays

Guillaume Lathoud

Accurate detection, localization and tracking of multiple moving speakers permits a wide spectrum of applications. Techniques are required that are versatile, robust to environmental variations, and not constraining for non-technical end-users. Based on di ...
EPFL2007

Short-Term Spatio-Temporal Clustering Applied to Multiple Moving Speakers

Jean-Marc Odobez, Guillaume Lathoud

Distant microphones permit to process spontaneous multi-party speech with very little constraints on speakers, as opposed to close-talking microphones. Minimizing the constraints on speakers permits a large diversity of applications, including meeting summ ...
2007

Observations on Multi-Band Asynchrony in Distant Speech Recordings

Guillaume Lathoud

Whenever the speech signal is captured by a microphone distant from the user, the acoustic response of the room introduces significant distortions. To remove these distortions from the signal, solutions exist that greatly improve the ASR performance (what ...
IDIAP2006

Spatio-Temporal Analysis of Spontaneous Speech with Microphone Arrays

Guillaume Lathoud

Accurate detection, localization and tracking of multiple moving speakers permits a wide spectrum of applications. Techniques are required that are versatile, robust to environmental variations, and not constraining for non-technical end-users. Based on di ...
IDIAP2006

Spatio-Temporal Analysis of Spontaneous Speech with Microphone Arrays

Guillaume Lathoud

Accurate detection, localization and tracking of multiple moving speakers permits a wide spectrum of applications. Techniques are required that are versatile, robust to environmental variations, and not constraining for non-technical end-users. Based on di ...
École Polytechnique Fédérale de Lausanne2006

Unsupervised Spectral Subtraction for Noise-Robust ASR on Unknown Transmission Channels

Hervé Bourlard, Guillaume Lathoud

This paper addresses several issues of classical spectral subtraction methods with respect to the automatic speech recognition task in noisy environments. The main contributions of this paper are twofold. First, a channel normalization method is proposed t ...
IDIAP2006

Further Applications of Sector-Based Detection and Short-Term Clustering

Guillaume Lathoud

This paper presents an effective implementation of detection-localization of multiple speech sources with microphone arrays. In particular, the Scaled Conjugate Gradient descent is used for fast and precise localization, within a pre-detected volume of spa ...
IDIAP2006

Audio-visual probabilistic tracking of multiple speakers in meetings

Daniel Gatica-Perez, Jean-Marc Odobez, Guillaume Lathoud

Tracking speakers in multiparty conversations constitutes a fundamental task for automatic meeting analysis. In this paper, we present a probabilistic approach to jointly track the location and speaking activity of multiple speakers in a multisensor meetin ...
2006

Sector-Based Detection for Hands-Free Speech Enhancement in Cars

Guillaume Lathoud

Adaptation control of beamforming interference cancellation techniques is investigated for in-car speech acquisition. Two efficient adaptation control methods are proposed that avoid target cancellation. The ``implicit'' method varies the step-size continu ...
2006

Threshold Selection for Unsupervised Detection, with an Application to Microphone Arrays

Hervé Bourlard, Guillaume Lathoud

Detection is usually done by comparing some criterion to a threshold. It is often desirable to keep a performance metric such as False Alarm Rate constant across conditions. Using training data to select the threshold may lead to suboptimal results on test ...
2006

Graph Chatbot

Chattez avec Graph Search

Posez n’importe quelle question sur les cours, conférences, exercices, recherches, actualités, etc. de l’EPFL ou essayez les exemples de questions ci-dessous.

AVERTISSEMENT : Le chatbot Graph n'est pas programmé pour fournir des réponses explicites ou catégoriques à vos questions. Il transforme plutôt vos questions en demandes API qui sont distribuées aux différents services informatiques officiellement administrés par l'EPFL. Son but est uniquement de collecter et de recommander des références pertinentes à des contenus que vous pouvez explorer pour vous aider à répondre à vos questions.