Publication

Front-end for Far-field Speech Recognition based on Frequency Domain Linear Prediction

Publications associées (37)

Graph Chatbot

Chattez avec Graph Search

Posez n’importe quelle question sur les cours, conférences, exercices, recherches, actualités, etc. de l’EPFL ou essayez les exemples de questions ci-dessous.

AVERTISSEMENT : Le chatbot Graph n'est pas programmé pour fournir des réponses explicites ou catégoriques à vos questions. Il transforme plutôt vos questions en demandes API qui sont distribuées aux différents services informatiques officiellement administrés par l'EPFL. Son but est uniquement de collecter et de recommander des références pertinentes à des contenus que vous pouvez explorer pour vous aider à répondre à vos questions.

Connectez-vous pour utiliser Chat avec Graph Search

Adaptive Beamforming with a Maximum Negentropy Criterion

Philip Neil Garner, Weifeng Li

In this paper, we address an adaptive beamforming application in realistic acoustic conditions. After the position of a speaker is estimated by a speaker tracking system, we construct a subband-domain beamformer in generalized sidelobe canceller (GSC) conf ...

2008

Adaptive Beamforming with a Maximum Negentropy Criterion

Philip Neil Garner, Weifeng Li

\begin{abstract} In this paper, we address an adaptive beamforming application in realistic acoustic conditions. After the position of a speaker is estimated by a speaker tracking system, we construct a subband-domain beamformer in \emph{generalized sidelo ...

IDIAP2008

A multimodal pattern recognition framework for speaker detection

Patricia Besson

Speaker detection is an important component of a speech-based user interface. Audiovisual speaker detection, speech and speaker recognition or speech synthesis for example find multiple applications in human-computer interaction, multimedia content indexin ...

EPFL2007

Adaptive Beamforming with a Minimum Mutual Information Criterion

In this work, we consider an acoustic beamforming application where two speakers are simultaneously active. We construct one subband-domain beamformer in \emph{generalized sidelobe canceller} (GSC) configuration for each source. In contrast to normal pract ...

IDIAP2007

Novel speech processing techniques for robust automatic speech recognition

Vivek Tyagi

The goal of this thesis is to develop and design new feature representations that can improve the automatic speech recognition (ASR) performance in clean as well noisy conditions. One of the main shortcomings of the fixed scale (typically 20-30 ms long ana ...

EPFL2006

Towards using slide information to enhance speech transcription of meetings

Hervé Bourlard, Artem Peregoudov, Alessandro Vinciarelli

In this paper we investigate the possibility of improving the speech recognition performance of meeting recordings by using slides captured during the recording process. The key hypothesis exploited in this work is that both slides and speech carry correla ...

IDIAP2006

Improving Speech Recognition Using a Data-Driven Approach

Hervé Bourlard, Guillermo Aradilla

In this paper, we investigate the possibility of enhancing state-of-the-art HMM-based speech recognition systems using data-driven techniques, where whole set of training utterances is used as reference models and recognition is then performed through the ...

IDIAP2005

Improving Speech Recognition Using a Data-Driven Approach

Hervé Bourlard, Guillermo Aradilla

2005

Noisy Text Categorization

Alessandro Vinciarelli

This work presents categorization experiments performed over noisy texts. By noisy it is meant any text obtained through an extraction process (affected by errors) from media other than digital texts (e.g. transcriptions of speech recordings extracted with ...

2005

Noisy Text Categorization

Alessandro Vinciarelli

IDIAP2004