Publications related to Privacy-Sensitive Audio Features for Speech/Nonspeech Detection

Improving Speech Recognition Using a Data-Driven Approach

In this paper, we investigate the possibility of enhancing state-of-the-art HMM-based speech recognition systems using data-driven techniques, where whole set of training utterances is used as reference models and recognition is then performed through the ...

IDIAP2005

Improving Speech Recognition Using a Data-Driven Approach

Hervé Bourlard, Guillermo Aradilla

In this paper, we investigate the possibility of enhancing state-of-the-art HMM-based speech recognition systems using data-driven techniques, where whole set of training utterances is used as reference models and recognition is then performed through the ...

2005

Using Pitch as Prior Knowledge in Template-Based Speech Recognition

Hervé Bourlard, Guillermo Aradilla

In a previous paper on speech recognition, we showed that templates can better capture the dynamics of speech signal compared to parametric models such as hidden Markov models. The key point in template matching approaches is finding the most similar templ ...

IDIAP2005

Spectro-Temporal Activity Pattern (STAP) Features for Noise Robust ASR

Hervé Bourlard, Hemant Misra, Shajith Ikbal

In this paper, we introduce a new noise robust representation of speech signal obtained by locating points of potential importance in the spectrogram, and parameterizing the activity of time-frequency pattern around those points. These features are referre ...

2004

Spectro-Temporal Activity Pattern (STAP) Features for Noise Robust ASR

Hervé Bourlard, Hemant Misra, Shajith Ikbal

In this paper, we introduce a new noise robust representation of speech signal obtained by locating points of potential importance in the spectrogram, and parameterizing the activity of time-frequency pattern around those points. These features are referre ...

IDIAP2004

Automatic Speech Recognition using Dynamic Bayesian Networks with the Energy as an Auxiliary Variable

In current automatic speech recognition (ASR) systems, the energy is not used as part of the feature vector in spite of being a fundamental feature in the speech signal. The noise inherent in its estimation degrades the system performance. In this report w ...

IDIAP2003

HMM mixtures (HMM2) for robust speech recognition

Katrin Weber

State-of-the-art automatic speech recognition (ASR) techniques are typically based on hidden Markov models (HMMs) for the modeling of temporal sequences of feature vectors extracted from the speech signal. At the level of each HMM state, Gaussian mixture m ...

EPFL2003

HMM Mixtures (HMM2) for Robust Speech Recognition

Katrin Weber

State-of-the-art automatic speech recognition (ASR) techniques are typically based on hidden Markov models (HMMs) for the modeling of temporal sequences of feature vectors extracted from the speech signal. At the level of each HMM state, Gaussian mixture m ...

Ecole Polytechnique Federale de Lausanne2003

HMM Mixtures (HMM2) for Robust Speech Recognition

Katrin Weber

State-of-the-art automatic speech recognition (ASR) techniques are typically based on hidden Markov models (HMMs) for the modeling of temporal sequences of feature vectors extracted from the speech signal. At the level of each HMM state, Gaussian mixture m ...

IDIAP2003

Robust Speech Recognition and Feature Extraction Using HMM2

Hervé Bourlard, Samy Bengio, Katrin Weber, Shajith Ikbal

This paper presents the theoretical basis and preliminary experimental results of a new HMM model, referred to as HMM2, which can be considered as a mixture of HMMs. In this new model, the emission probabilities of the temporal (primary) HMM are estimated ...

2003

Privacy-Sensitive Audio Features for Speech/Nonspeech Detection

Graph Chatbot

Chat with Graph Search

Improving Speech Recognition Using a Data-Driven Approach

Improving Speech Recognition Using a Data-Driven Approach

Using Pitch as Prior Knowledge in Template-Based Speech Recognition

Spectro-Temporal Activity Pattern (STAP) Features for Noise Robust ASR

Spectro-Temporal Activity Pattern (STAP) Features for Noise Robust ASR

Automatic Speech Recognition using Dynamic Bayesian Networks with the Energy as an Auxiliary Variable

HMM mixtures (HMM2) for robust speech recognition

HMM Mixtures (HMM2) for Robust Speech Recognition

HMM Mixtures (HMM2) for Robust Speech Recognition

Robust Speech Recognition and Feature Extraction Using HMM2

Robust Speech Recognition and Feature Extraction Using HMM2

HMM mixtures (HMM2) for robust speech recognition

HMM Mixtures (HMM2) for Robust Speech Recognition

HMM Mixtures (HMM2) for Robust Speech Recognition

Automatic Speech Recognition using Dynamic Bayesian Networks with the Energy as an Auxiliary Variable

Using Pitch as Prior Knowledge in Template-Based Speech Recognition

Spectro-Temporal Activity Pattern (STAP) Features for Noise Robust ASR

Improving Speech Recognition Using a Data-Driven Approach

Improving Speech Recognition Using a Data-Driven Approach

Spectro-Temporal Activity Pattern (STAP) Features for Noise Robust ASR