Publications associées à Robust overlapping speech recognition based on neural networks

Text Detection and Recognition in Images and Videos

Hervé Bourlard, Jean-Marc Odobez, Datong Chen

Text embedded in images and videos represents a rich source of information for content-based indexing and retrieval applications. In this paper, we present a new method for localizing and recognizing text in complex images and videos. Text localization is ...

2004

Entropy Based Combination of Tandem Representations for Noise Robust ASR

Hervé Bourlard, Hynek Hermansky, Hemant Misra, Shajith Ikbal

In this paper, we present an entropy based method to combine tandem representations of the recently proposed Phase AutoCorrelation (PAC) based features and Mel-Frequency Cepstral Coefficients (MFCC) features. PAC based features, derived from a nonlinear tr ...

2004

Entropy Based Combination of Tandem Representations for Noise Robust ASR

Hervé Bourlard, Hynek Hermansky, Hemant Misra, Shajith Ikbal

In this paper, we present an entropy based method to combine tandem representations of the recently proposed Phase AutoCorrelation (PAC) based features and Mel-Frequency Cepstral Coefficients (MFCC) features. PAC based features, derived from a nonlinear tr ...

IDIAP2004

HMM mixtures (HMM2) for robust speech recognition

Katrin Weber

State-of-the-art automatic speech recognition (ASR) techniques are typically based on hidden Markov models (HMMs) for the modeling of temporal sequences of feature vectors extracted from the speech signal. At the level of each HMM state, Gaussian mixture m ...

EPFL2003

HMM Mixtures (HMM2) for Robust Speech Recognition

Katrin Weber

State-of-the-art automatic speech recognition (ASR) techniques are typically based on hidden Markov models (HMMs) for the modeling of temporal sequences of feature vectors extracted from the speech signal. At the level of each HMM state, Gaussian mixture m ...

Ecole Polytechnique Federale de Lausanne2003

HMM Mixtures (HMM2) for Robust Speech Recognition

Katrin Weber

State-of-the-art automatic speech recognition (ASR) techniques are typically based on hidden Markov models (HMMs) for the modeling of temporal sequences of feature vectors extracted from the speech signal. At the level of each HMM state, Gaussian mixture m ...

IDIAP2003

Low cost duration modelling for noise robust speech recognition

Hervé Bourlard

State transition matrices as used in standard HMM decoders have two widely perceived limitations. One is that the implicit Geometric state duration distributions which they model do not accurately reflect true duration distributions. The other is that they ...

2002

Low cost duration modelling for noise robust speech recognition

Hervé Bourlard

State transition matrices as used in standard HMM decoders have two widely perceived limitations. One is that the implicit Geometric state duration distributions which they model do not accurately reflect true duration distributions. The other is that they ...

IDIAP2002

Robust Speech Recognition with Small Microphone Arrays using the Missing Data Approach

Hervé Bourlard

Traditional microphone array speech recognition systems simply recognise the enhanced output of the array. As the level of signal enhancement depends on the number of microphones, such systems do not achieve acceptable speech recognition performance for ar ...

IDIAP2002

Robust Speech Recognition with Small Microphone Arrays using the Missing Data Approach

Hervé Bourlard

Traditional microphone array speech recognition systems simply recognise the enhanced output of the array. As the level of signal enhancement depends on the number of microphones, such systems do not achieve acceptable speech recognition performance for ar ...

2002

Robust overlapping speech recognition based on neural networks

Graph Chatbot

Chattez avec Graph Search

Text Detection and Recognition in Images and Videos

Entropy Based Combination of Tandem Representations for Noise Robust ASR

Entropy Based Combination of Tandem Representations for Noise Robust ASR

HMM mixtures (HMM2) for robust speech recognition

HMM Mixtures (HMM2) for Robust Speech Recognition

HMM Mixtures (HMM2) for Robust Speech Recognition

Low cost duration modelling for noise robust speech recognition

Low cost duration modelling for noise robust speech recognition

Robust Speech Recognition with Small Microphone Arrays using the Missing Data Approach

Robust Speech Recognition with Small Microphone Arrays using the Missing Data Approach

Entropy Based Combination of Tandem Representations for Noise Robust ASR

HMM mixtures (HMM2) for robust speech recognition

HMM Mixtures (HMM2) for Robust Speech Recognition

HMM Mixtures (HMM2) for Robust Speech Recognition

Low cost duration modelling for noise robust speech recognition

Low cost duration modelling for noise robust speech recognition

Robust Speech Recognition with Small Microphone Arrays using the Missing Data Approach

Text Detection and Recognition in Images and Videos

Entropy Based Combination of Tandem Representations for Noise Robust ASR

Robust Speech Recognition with Small Microphone Arrays using the Missing Data Approach