Publication

How does a dictation machine recognize speech?

Related publications (47)

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

HMM Mixtures (HMM2) for Robust Speech Recognition

Katrin Weber

State-of-the-art automatic speech recognition (ASR) techniques are typically based on hidden Markov models (HMMs) for the modeling of temporal sequences of feature vectors extracted from the speech signal. At the level of each HMM state, Gaussian mixture m ...

Ecole Polytechnique Federale de Lausanne2003

HMM Mixtures (HMM2) for Robust Speech Recognition

Katrin Weber

IDIAP2003

Automatic Speech Recognition using Dynamic Bayesian Networks with the Energy as an Auxiliary Variable

In current automatic speech recognition (ASR) systems, the energy is not used as part of the feature vector in spite of being a fundamental feature in the speech signal. The noise inherent in its estimation degrades the system performance. In this report w ...

IDIAP2003

Speech/Music Discrimination using Entropy and Dynamism Features in a HMM Classification Framework

Hervé Bourlard, Jitendra Ajmera

In this paper, we present a new approach towards high performance speech/music discrimination on realistic tasks related to the automatic transcription of broadcast news. In the approach presented here, the (local) Probability Density Function (PDF) estima ...

2003

An Online Audio Indexing System

Hervé Bourlard, Jitendra Ajmera

This paper presents overview of an online audio indexing system, which creates a searchable index of speech content embedded in digitized audio files. This system is based on our recently proposed offline audio segmentation techniques. As the data arrives ...

IDIAP2003

A Pragmatic View of the Application of HMM2 for ASR

Hervé Bourlard, Samy Bengio, Katrin Weber

This report investigates the HMM2 approach recently introduced in the framework of automatic speech recognition. HMM2 can be seen as a mixture of HMMs, where a conventional primary HMM (processing a time series of speech data) is supported on a lower level ...

IDIAP2001

Speech/Music Discrimination using Entropy and Dynamism Features in a HMM Classification Framewor

Hervé Bourlard, Jitendra Ajmera

IDIAP2001

EPFL lab session 2/2: Introduction to Hidden Markov Models

Sacha Krstulovic

Lab sessions given in relation to Herve Bourlard's Speech Recognition course at EPFL (Ecole Polytechnique Federale de Lausanne), second semester 2001. The full session is available from the web as ftp://ftp.idiap.ch/pub/sacha/labs/Session2.tgz . ...

IDIAP2001

Speech Recognition Using Advanced HMM2 Features

Hervé Bourlard, Samy Bengio, Katrin Weber

HMM2 is a particular hidden Markov model where state emission probabilities of the temporal (primary) HMM are modeled through (secondary) state-dependent frequency-based HMMs [12]. As shown in [13], a secondary HMM can also be used to extract robust ASR fe ...

IDIAP2001

Speech Recognition Using Advanced HMM2 Features

Hervé Bourlard, Samy Bengio, Katrin Weber

2001