Publication

Using the Multi-Stream Approach for Continuous Audio-Visual Speech Recognition

Publications associées (33)

Graph Chatbot

Chattez avec Graph Search

Posez n’importe quelle question sur les cours, conférences, exercices, recherches, actualités, etc. de l’EPFL ou essayez les exemples de questions ci-dessous.

AVERTISSEMENT : Le chatbot Graph n'est pas programmé pour fournir des réponses explicites ou catégoriques à vos questions. Il transforme plutôt vos questions en demandes API qui sont distribuées aux différents services informatiques officiellement administrés par l'EPFL. Son but est uniquement de collecter et de recommander des références pertinentes à des contenus que vous pouvez explorer pour vous aider à répondre à vos questions.

Connectez-vous pour utiliser Chat avec Graph Search

Efficient integration of automated speech recognition in the framework of dialogue-based vocal systems

In this work, we propose different strategies for efficiently integrating an automated speech recognition module in the framework of a dialogue-based vocal system. The aim is the study of different ways leading to the improvement of the quality and robustn ...

EPFL2005

Improving Speech Recognition Using a Data-Driven Approach

Hervé Bourlard, Guillermo Aradilla

In this paper, we investigate the possibility of enhancing state-of-the-art HMM-based speech recognition systems using data-driven techniques, where whole set of training utterances is used as reference models and recognition is then performed through the ...

IDIAP2005

Improving Speech Recognition Using a Data-Driven Approach

Hervé Bourlard, Guillermo Aradilla

2005

Spectro-Temporal Activity Pattern (STAP) Features for Noise Robust ASR

Hervé Bourlard, Hemant Misra, Shajith Ikbal

In this paper, we introduce a new noise robust representation of speech signal obtained by locating points of potential importance in the spectrogram, and parameterizing the activity of time-frequency pattern around those points. These features are referre ...

2004

Spectro-Temporal Activity Pattern (STAP) Features for Noise Robust ASR

Hervé Bourlard, Hemant Misra, Shajith Ikbal

IDIAP2004

Acoustic Echo Cancellation for Human-Robot Communications

Jérôme Berclaz

This master thesis presents a new efficient method of acoustic echo cancellation targeted at speech recognition for robots. The proposed algorithm features a new double-talk detector, an enhanced initialization and a new noise estimation method. The DTD al ...

2004

Using pitch frequency information in speech recognition

Hervé Bourlard

Automatic Speech Recognition systems typically use smoothed spectral features as acoustic observations. In recent studies, it has been shown that complementing these standard features with pitch frequency could improve the system performance of the system. ...

IDIAP2003

Using pitch frequency information in speech recognition

Hervé Bourlard

2003

Automatic Speech Recognition using Pitch Information in Dynamic Bayesian Networks

Hervé Bourlard

The challenge of automatic speech recognition (ASR) increases when speaker variability is encountered. Being able to automatically use different acoustic models according to speaker type might help to increase the robustness of ASR. We present a system tha ...

IDIAP2000

Reconnaissance et transformation de locuteurs

Dominique Genoud

This PhD thesis tries to understand how to analyse, decompose, model and transform the vocal identity of a human when seen through an automatic speaker recognition application. It starts with an introduction explaining the properties of the speech signal a ...

EPFL1999