Publication

Using more informative posterior probabilities for speech recognition

Publications associées (37)

Graph Chatbot

Chattez avec Graph Search

Posez n’importe quelle question sur les cours, conférences, exercices, recherches, actualités, etc. de l’EPFL ou essayez les exemples de questions ci-dessous.

AVERTISSEMENT : Le chatbot Graph n'est pas programmé pour fournir des réponses explicites ou catégoriques à vos questions. Il transforme plutôt vos questions en demandes API qui sont distribuées aux différents services informatiques officiellement administrés par l'EPFL. Son but est uniquement de collecter et de recommander des références pertinentes à des contenus que vous pouvez explorer pour vous aider à répondre à vos questions.

Connectez-vous pour utiliser Chat avec Graph Search

Acoustic Models for Posterior Features in Speech Recognition

Guillermo Aradilla

In this thesis, we investigate the use of posterior probabilities of sub-word units directly as input features for automatic speech recognition (ASR). These posteriors, estimated from data-driven methods, display some favourable properties such as increase ...

Ecole Polytechnique Fédérale de Lausanne2008

Acoustic Models for Posterior Features in Speech Recognition

Guillermo Aradilla

Idiap2008

In-Context Phone Posteriors as Complementary Features for Tandem ASR

Hervé Bourlard, Hamed Ketabdar

In this paper, we present a method for integrating possible prior knowledge (such as phonetic and lexical knowledge), as well as acoustic context (e.g., the whole utterance) in the phone posterior estimation, and we propose to use the obtained posteriors a ...

2008

Enhancing posterior based speech recognition systems

Hamed Ketabdar

The use of local phoneme posterior probabilities has been increasingly explored for improving speech recognition systems. Hybrid hidden Markov model / artificial neural network (HMM/ANN) and Tandem are the most successful examples of such systems. In this ...

EPFL2008

Enhancing posterior based speech recognition systems

Hamed Ketabdar

Ecole Polytechnique Fédérale de Lausanne2008

Enhanced Phone Posteriors for Improving Speech Recognition Systems

Hervé Bourlard, Hamed Ketabdar

Using phone posterior probabilities has been increasingly explored for improving automatic speech recognition (ASR) systems. In this paper, we propose two approaches for hierarchically enhancing these phone posteriors, by integrating long acoustic context, ...

IDIAP2008

Using Pitch as Prior Knowledge in Template-Based Speech Recognition

Hervé Bourlard, Guillermo Aradilla

In a previous paper on speech recognition, we showed that templates can better capture the dynamics of speech signal compared to parametric models such as hidden Markov models. The key point in template matching approaches is finding the most similar templ ...

2006

Map-matching for pedestrians via Bayesian inference

Michel Bierlaire, Bertrand Merminod

A navigation process is to start from a known (initial) position and to ensure a continued localisation of the user during the movement. Consider a pedestrian navigation system which contains a GPS receiver and a set of inertial sensors connected with the ...

2006

Using more informative posterior probabilities for speech recognition

Hervé Bourlard, Samy Bengio, Hamed Ketabdar

In this paper, we present initial investigations towards boosting posterior probability based speech recognition systems by estimating more informative posteriors taking into account acoustic context (e.g., the whole utterance), as well as possible prior i ...

IDIAP2005

Hierarchical Multi-Stream Posterior Based Speech Recognition System

Hervé Bourlard, Samy Bengio, Hamed Ketabdar

In this paper, we present initial results towards boosting posterior based speech recognition systems by estimating more informative posteriors using multiple streams of features and taking into account acoustic context (e.g., as available in the whole utt ...

2005