Publication

Using Posterior-Based Features in Template Matching for Speech Recognition

Related publications (34)

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

Quantum fluctuations of one-dimensional free fermions and Fisher-Hartwig formula for Toeplitz determinants

We revisit the problem of finding the probability distribution of a fermionic number of one-dimensional spinless free fermions on a segment of a given length. The generating function for this probability distribution can be expressed as a determinant of a ...

2011

AMIDA/Klewel Mini-Project

Petr Motlicek, Philip Neil Garner, Vincent Bozzo

The goal of the AMIDA mini-project is to transfer some of the technologies developed within the AMIDA project to be used by a Klewel retrieval system. More specifically, the main focus is to develop a speech-to-text application based on the AMIDA Automatic ...

Idiap2010

Audio-visual reliability estimates using stream entropy for speech recognition

Jean-Philippe Thiran, Mihai Gurban

We present a method for multimodal fusion based on the estimated reliability of each individual modality. Our method uses an information theoretic measure, the entropy derived from the state probability distribution for each stream, as an estimate of relia ...

2009

Acoustic models for posterior features in speech recognition

Guillermo Aradilla

In this thesis, we investigate the use of posterior probabilities of sub-word units directly as input features for automatic speech recognition (ASR). These posteriors, estimated from data-driven methods, display some favourable properties such as increase ...

EPFL2008

Acoustic Models for Posterior Features in Speech Recognition

Guillermo Aradilla

Ecole Polytechnique Fédérale de Lausanne2008

Acoustic Models for Posterior Features in Speech Recognition

Guillermo Aradilla

Idiap2008

Stationary Features and Cat Detection

François Fleuret

Most discriminative techniques for detecting instances from object categories in still images consist of looping over a partition of a pose space with dedicated binary classifiers. The efficiency of this strategy for a complex pose, i.e., for fine-grained ...

2008

Using entropy as a stream reliability estimate for audio-visual speech recognition

Jean-Philippe Thiran, Mihai Gurban

We present a method for dynamically integrating audio-visual information for speech recognition, based on the estimated reliability of the audio and visual streams. Our method uses an information theoretic measure, the entropy derived from the state probab ...

2008

Stationary Features and Cat Detection

François Fleuret

IDIAP2007

Using Posterior-Based Features in Template Matching for Speech Recognition

Hervé Bourlard, Guillermo Aradilla

Given the availability of large speech corpora, as well as the increasing of memory and computational resources, the use of template matching approaches for automatic speech recognition (ASR) have recently attracted new attention. In such template-based ap ...

2006