Publication

Enhancing posterior based speech recognition systems

Related publications (70)

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

Using more informative posterior probabilities for speech recognition

Hervé Bourlard, Samy Bengio, Hamed Ketabdar

In this paper, we present initial investigations towards boosting posterior probability based speech recognition systems by estimating more informative posteriors taking into account acoustic context (e.g., the whole utterance), as well as possible prior i ...

IDIAP2005

A Discriminative Decoder for the Recognition of Phoneme Sequences

Samy Bengio, David Grangier

In this report, we propose a discriminative decoder for phoneme recognition, i.e. the identification of the uttered phoneme sequence from a speech recording. This task is solved as a 3 step process: a phoneme classifier first classifies each accoustic fram ...

IDIAP2005

Improving Continuous Speech Recognition System Performance with Grapheme Modelling

Hervé Bourlard, Hynek Hermansky, John David Scott Dines

This paper investigates automatic speech recognition system using context-dependent graphemes as subword units based on the conventional HMM/GMM system as well as TANDEM system. Experimental studies conducted on two different continuous speech recognition ...

IDIAP2005

Hierarchical approach for spotting keywords

The paper presents a new approach to spotting a particular sound (keyword) in an acoustic stream. The approach is based on hierarchical processing where equally-sampled posterior probabilities of phoneme classes are estimated first, followed by matched fil ...

IDIAP2005

A Kernel Trick For Sequences Applied to Text-Independent Speaker Verification Systems

Samy Bengio

This paper present a principled SVM based speaker verification system. We propose a new framework and a new sequence kernel that can make use of any Mercer kernel at the frame level. An extension of the sequence kernel based on the Max operator is also pro ...

IDIAP2005

Joint Decoding for Phoneme-Grapheme Continuous Speech Recognition

Hervé Bourlard, Samy Bengio

Standard ASR systems typically use phoneme as the subword units. Preliminary studies have shown that the performance of the ASR system could be improved by using grapheme as additional subword units. In this paper, we investigate such a system where the wo ...

2004

Phoneme vs Grapheme Based Automatic Speech Recognition

Hervé Bourlard, Hynek Hermansky, John David Scott Dines

In recent literature, different approaches have been proposed to use graphemes as subword units with implicit source of phoneme information for automatic speech recognition. The major advantage of using graphemes as subword units is that the definition of ...

IDIAP2004

Joint Decoding for Phoneme-Grapheme Continuous Speech Recognition

Hervé Bourlard, Samy Bengio

IDIAP2003

User-Customized Password HMM Based Speaker Verification

Hervé Bourlard

is presented. The system has no {\it a priori} knowledge of passwords. A hybrid HMM/ANN system is used to infer the phonetic transcription of the password. The emission probabilities are then modeled by a multi-Gaussians HMM model. Evaluation experiments, ...

IDIAP2002

User-Customized Password HMM Based Speaker Verification

Hervé Bourlard

2002