Publication

On Confusions in a Phoneme Recognizer

Publications associées (41)

Graph Chatbot

Chattez avec Graph Search

Posez n’importe quelle question sur les cours, conférences, exercices, recherches, actualités, etc. de l’EPFL ou essayez les exemples de questions ci-dessous.

AVERTISSEMENT : Le chatbot Graph n'est pas programmé pour fournir des réponses explicites ou catégoriques à vos questions. Il transforme plutôt vos questions en demandes API qui sont distribuées aux différents services informatiques officiellement administrés par l'EPFL. Son but est uniquement de collecter et de recommander des références pertinentes à des contenus que vous pouvez explorer pour vous aider à répondre à vos questions.

Connectez-vous pour utiliser Chat avec Graph Search

Modelling Auxiliary Features in Tandem Systems

Hervé Bourlard, Shajith Ikbal

Tandem systems transform the cepstral features into posterior probabilities of subword units using artificial neural networks (ANNs), which are processed to form input features for conventional speech recognition systems. They have been shown to perform be ...

2004

Modelling Auxiliary Features in Tandem Systems

Hervé Bourlard, Shajith Ikbal

IDIAP2004

Towards using hierarchical posteriors for flexible automatic speech recognition systems

Hervé Bourlard, Samy Bengio, Bertrand Mesot

Local state (or phone) posterior probabilities are often investigated as local classifiers (e.g., hybrid HMM/ANN systems) or as transformed acoustic features (e.g., ``Tandem'') towards improved speech recognition systems. In this paper, we present initial ...

IDIAP2004

Joint Decoding for Phoneme-Grapheme Continuous Speech Recognition

Hervé Bourlard, Samy Bengio

Standard ASR systems typically use phoneme as the subword units. Preliminary studies have shown that the performance of the ASR system could be improved by using grapheme as additional subword units. In this paper, we investigate such a system where the wo ...

IDIAP2003

On the Combination of Speech and Speaker Recognition

Hervé Bourlard

This paper investigates an approach that maximizes the joint posterior probabil ity of the pronounced word and the speaker identity given the observed data. This probability can be expressed as a product of the posterior probability of the pronounced word ...

IDIAP2003

On the Combination of Speech and Speaker Recognition

Hervé Bourlard

2003

Using posterior probabilities for speech/music discrimination

Automatic speech/music discrimination has been receiving importance recently, for example when large multimedia documents have to be processed by an ASR system, or for indexing and retrieval of such documents. This work presents using outputs of a speech r ...

IDIAP2001

Multi-stream adaptive evidence combination for noise robust ASR

Hervé Bourlard, Astrid Hagen

In this paper we develop different mathematical models in the framework of the multi-stream paradigm for noise robust ASR, and discuss their close relationship with human speech perception. Largely inspired by Fletcher's "product-of-errors" rule in psychoa ...

NORTH-HOLLAND2001

Different Weighting Schemes in the Full Combination Subbands Approach for Noise Robust ASR

Hervé Bourlard, Astrid Hagen

In this paper, we present and investigate a new method for subband-based Automatic Speech Recognition (ASR) which approximates the ideal full combination' approach which is itself often not practical to realize. The full combination' approach consists of ...

1999

Multi-stream adaptive evidence combination for noise robust ASR

Hervé Bourlard, Astrid Hagen

IDIAP1999