Publications related to Sparse Autoencoders for Speech Modeling and Recognition

Effect of Recognition Errors on Text Clustering

This paper presents clustering experiments performed over noisy texts (i.e. texts that have been extracted through an automatic process like character or speech recognition). The effect of recognition errors is investigated by comparing clustering results ...

IDIAP2004

HMM mixtures (HMM2) for robust speech recognition

Katrin Weber

State-of-the-art automatic speech recognition (ASR) techniques are typically based on hidden Markov models (HMMs) for the modeling of temporal sequences of feature vectors extracted from the speech signal. At the level of each HMM state, Gaussian mixture m ...

EPFL2003

HMM Mixtures (HMM2) for Robust Speech Recognition

Katrin Weber

State-of-the-art automatic speech recognition (ASR) techniques are typically based on hidden Markov models (HMMs) for the modeling of temporal sequences of feature vectors extracted from the speech signal. At the level of each HMM state, Gaussian mixture m ...

Ecole Polytechnique Federale de Lausanne2003

HMM Mixtures (HMM2) for Robust Speech Recognition

Katrin Weber

State-of-the-art automatic speech recognition (ASR) techniques are typically based on hidden Markov models (HMMs) for the modeling of temporal sequences of feature vectors extracted from the speech signal. At the level of each HMM state, Gaussian mixture m ...

IDIAP2003

Robust Speech Recognition and Feature Extraction Using HMM2

Hervé Bourlard, Samy Bengio, Katrin Weber, Shajith Ikbal

This paper presents the theoretical basis and preliminary experimental results of a new HMM model, referred to as HMM2, which can be considered as a mixture of HMMs. In this new model, the emission probabilities of the temporal (primary) HMM are estimated ...

2003

Using pitch frequency information in speech recognition

Hervé Bourlard

Automatic Speech Recognition systems typically use smoothed spectral features as acoustic observations. In recent studies, it has been shown that complementing these standard features with pitch frequency could improve the system performance of the system. ...

IDIAP2003

Using pitch frequency information in speech recognition

Hervé Bourlard

Automatic Speech Recognition systems typically use smoothed spectral features as acoustic observations. In recent studies, it has been shown that complementing these standard features with pitch frequency could improve the system performance of the system. ...

2003

Evaluation of Formant-Like Features for ASR

Hervé Bourlard, Samy Bengio, Katrin Weber

This paper investigates possibilities to automatically find a low-dimensional, formant-related physical representation of the speech signal, which is suitable for automatic speech recognition (ASR). This aim is motivated by the fact that formants have been ...

IDIAP2002

Evaluation of Formant-Like Features for ASR

Hervé Bourlard, Samy Bengio, Katrin Weber

This paper investigates possibilities to automatically find a low-dimensional, formant-related physical representation of the speech signal, which is suitable for automatic speech recognition (ASR). This aim is motivated by the fact that formants have been ...

2002

Speech Processing & Text-Independent Automatic Person Verification

In this communication we first review the human speech production process and feature extraction approaches commonly used in a speaker verification system. Experiments on the telephone speech {NTIMIT} database suggest that the performance degradation of a ...

IDIAP2002

Sparse Autoencoders for Speech Modeling and Recognition

Graph Chatbot

Chat with Graph Search

Effect of Recognition Errors on Text Clustering

HMM mixtures (HMM2) for robust speech recognition

HMM Mixtures (HMM2) for Robust Speech Recognition

HMM Mixtures (HMM2) for Robust Speech Recognition

Robust Speech Recognition and Feature Extraction Using HMM2

Using pitch frequency information in speech recognition

Using pitch frequency information in speech recognition

Evaluation of Formant-Like Features for ASR

Evaluation of Formant-Like Features for ASR

Speech Processing & Text-Independent Automatic Person Verification

Effect of Recognition Errors on Text Clustering

Using pitch frequency information in speech recognition

HMM mixtures (HMM2) for robust speech recognition

HMM Mixtures (HMM2) for Robust Speech Recognition

Speech Processing & Text-Independent Automatic Person Verification

HMM Mixtures (HMM2) for Robust Speech Recognition

Robust Speech Recognition and Feature Extraction Using HMM2

Evaluation of Formant-Like Features for ASR

Using pitch frequency information in speech recognition

Evaluation of Formant-Like Features for ASR