Publication

Posterior-based Sparse Representation for Automatic Speech Recognition

Related publications (52)

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

Fast keyword detection with sparse time-frequency models

Pascal Frossard, Olivier Verscheure, Effrosyni Kokiopoulou

We address the problem of keyword spotting in continuous speech streams when training and testing conditions can be different. We propose a keyword spotting algorithm based on sparse representation of speech signals in a time-frequency feature space. The t ...

2008

Correcting Confusion Matrices for Phone Recognizers

Modern speech recognition has many ways of quantifying the misrecognitions a speech recognizer makes. The errors in modern speech recognition makes extensive use of the Levenshtein algorithm to find the distance between the labeled target and the recognize ...

IDIAP2007

Efficient integration of automated speech recognition in the framework of dialogue-based vocal systems

In this work, we propose different strategies for efficiently integrating an automated speech recognition module in the framework of a dialogue-based vocal system. The aim is the study of different ways leading to the improvement of the quality and robustn ...

EPFL2005

HMM mixtures (HMM2) for robust speech recognition

Katrin Weber

State-of-the-art automatic speech recognition (ASR) techniques are typically based on hidden Markov models (HMMs) for the modeling of temporal sequences of feature vectors extracted from the speech signal. At the level of each HMM state, Gaussian mixture m ...

EPFL2003

HMM Mixtures (HMM2) for Robust Speech Recognition

Katrin Weber

Ecole Polytechnique Federale de Lausanne2003

HMM Mixtures (HMM2) for Robust Speech Recognition

Katrin Weber

IDIAP2003

TODE: A Decoder for Continuous Speech Recognition

Darren Moore

This document describes a new continuous speech decoder, TODE, which is compatible with the Torch machine learning software library. A brief theory of speech recognition is presented followed by a detailed description of the architecture of TODE and the co ...

IDIAP2002

HMM2- Extraction of Formant Features and their Use for Robust ASR

Hervé Bourlard, Samy Bengio, Katrin Weber

As recently introduced, an HMM2 can be considered as a particular case of an HMM mixture in which the HMM emission probabilities (usually estimated through Gaussian mixtures or an artificial neural network) are modeled by state-dependent, feature-based HMM ...

2001

HMM2- Extraction of Formant Features and their Use for Robust ASR

Hervé Bourlard, Samy Bengio, Katrin Weber

IDIAP2000

Improving Posterior Based Confidence Measures in Hybrid HMM/ANN Speech Recognition Systems

Hervé Bourlard, Giulia Bernardis

In this paper we define and investigate a set of confidence measures based on hybrid Hidden Markov Model/Artificial Neural Network (HMM/ANN) acoustic models. All these measures are using the neural network to estimate the local phone posterior probabilitie ...

IDIAP1998