Publications related to Investigation of kNN Classifier on Posterior Features Towards Application in Automatic Speech Recognition

Robust audio segmentation

Audio segmentation, in general, is the task of segmenting a continuous audio stream in terms of acoustically homogenous regions, where the rule of homogeneity depends on the task. This thesis aims at developing and investigating efficient, robust and unsup ...

EPFL2005

Towards ASR Based on Hierarchical Posterior-Based Keyword Recognition

Hynek Hermansky

The paper presents an alternative approach to automatic recognition of speech in which each targeted word is classified by a separate binary classifier against all other sounds. No time alignment is done. To build a recognizer for N words, N parallel binar ...

IDIAP2005

Robust Audio Segmentation

Hervé Bourlard, Jitendra Ajmera

Audio segmentation, in general, is the task of segmenting a continuous audio stream in terms of acoustically homogenous regions, where the rule of homogeneity depends on the task. This thesis aims at developing and investigating efficient, robust and unsup ...

IDIAP2004

Robust Audio Segmentation

Hervé Bourlard, Jitendra Ajmera

Audio segmentation, in general, is the task of segmenting a continuous audio stream in terms of acoustically homogenous regions, where the rule of homogeneity depends on the task. This thesis aims at developing and investigating efficient, robust and unsup ...

École Polytechnique Fédérale de Lausanne2004

Phase AutoCorrelation (PAC) features in Entropy based Multi-Stream for Robust Speech Recognition

Hervé Bourlard, Hynek Hermansky, Hemant Misra, Shajith Ikbal

Methods to improve noise robustness of speech recognition systems often result in degradation of recognition performance for clean speech. Recently proposed Phase AutoCorrelation (PAC) \cite{ikbal03,ikbal03a} based features, showing noticeable improvement ...

2004

Towards using hierarchical posteriors for flexible automatic speech recognition systems

Hervé Bourlard, Samy Bengio, Bertrand Mesot

Local state (or phone) posterior probabilities are often investigated as local classifiers (e.g., hybrid HMM/ANN systems) or as transformed acoustic features (e.g., ``Tandem'') towards improved speech recognition systems. In this paper, we present initial ...

IDIAP2004

Posteriori Probabilities and Likelihoods Combination for Speech and Speaker Recognition

Hervé Bourlard

This paper investigates a new approach to perform simultaneous speech and speaker recognition. The likelihood estimated by a speaker identification system is combined with the posterior probability estimated by the speech recognizer. So, the joint posterio ...

IDIAP2004

Posteriori Probabilities and Likelihoods Combination for Speech and Speaker Recognition

Hervé Bourlard

This paper investigates a new approach to perform simultaneous speech and speaker recognition. The likelihood estimated by a speaker identification system is combined with the posterior probability estimated by the speech recognizer. So, the joint posterio ...

2004

Modelling Auxiliary Features in Tandem Systems

Hervé Bourlard, Shajith Ikbal

Tandem systems transform the cepstral features into posterior probabilities of subword units using artificial neural networks (ANNs), which are processed to form input features for conventional speech recognition systems. They have been shown to perform be ...

2004

Modelling Auxiliary Features in Tandem Systems

Hervé Bourlard, Shajith Ikbal

Tandem systems transform the cepstral features into posterior probabilities of subword units using artificial neural networks (ANNs), which are processed to form input features for conventional speech recognition systems. They have been shown to perform be ...

IDIAP2004

Investigation of kNN Classifier on Posterior Features Towards Application in Automatic Speech Recognition

Graph Chatbot

Chat with Graph Search

Robust audio segmentation

Towards ASR Based on Hierarchical Posterior-Based Keyword Recognition

Robust Audio Segmentation

Robust Audio Segmentation

Phase AutoCorrelation (PAC) features in Entropy based Multi-Stream for Robust Speech Recognition

Towards using hierarchical posteriors for flexible automatic speech recognition systems

Posteriori Probabilities and Likelihoods Combination for Speech and Speaker Recognition

Posteriori Probabilities and Likelihoods Combination for Speech and Speaker Recognition

Modelling Auxiliary Features in Tandem Systems

Modelling Auxiliary Features in Tandem Systems

Posteriori Probabilities and Likelihoods Combination for Speech and Speaker Recognition

Towards ASR Based on Hierarchical Posterior-Based Keyword Recognition

Posteriori Probabilities and Likelihoods Combination for Speech and Speaker Recognition

Robust audio segmentation

Robust Audio Segmentation

Phase AutoCorrelation (PAC) features in Entropy based Multi-Stream for Robust Speech Recognition

Towards using hierarchical posteriors for flexible automatic speech recognition systems

Modelling Auxiliary Features in Tandem Systems

Robust Audio Segmentation

Modelling Auxiliary Features in Tandem Systems