Concept

Speech

Related publications (224)

Automatic Speech Recognition using Pitch Information in Dynamic Bayesian Networks

The challenge of automatic speech recognition (ASR) increases when speaker variability is encountered. Being able to automatically use different acoustic models according to speaker type might help to increase the robustness of ASR. We present a system tha ...

IDIAP2000

Some applications of a priori knowledge in multi-stream HMM and HMM/ANN based ASR

Multi-band ASR was largely inspired by the extremely high level of redundancy in the spectral signal representation which can be inferred from Fletcher's product-of-errors rule for human speech perception. Indeed, the main aim of the multi-band approach is ...

2000

Automatic Speech Recognition using Dynamic Bayesian Networks with both Acoustic and Articulatory Variables

Hervé Bourlard, Samy Bengio

Current technology for automatic speech recognition (ASR) uses hidden Markov models (HMMs) that recognize spoken speech using the acoustic signal. However, no use is made of the causes of the acoustic signal: the articulators. We present here a dynamic Bay ...

IDIAP2000

Automatic Speech Recognition using Dynamic Bayesian Networks with both Acoustic and Articulatory Variables

Hervé Bourlard, Samy Bengio

2000

Relating LPC modeling to a factor-based articulatory model

Sacha Krstulovic

This paper proposes a method for recovering the articulatory parameters of a factor-based vocal tract shape model from the speech waveform. This is realized by analytically relating the shape model to a Linear Prediction lattice filter. Results pertaining ...

2000

Language modeling based on neural clustering of words

This document describes a neural method for clustering words and its use in language modeling for speech recognizers. The method is based on clustering the words which appear on similar local context and estimating the parameters needed for language modeli ...

IDIAP2000

INtegrating SPEech acoustic and linguistic Constraints: Baseline System Development

Hervé Bourlard, Martin Rajman, Jean-Cédric Chappelier, Giulia Bernardis

In this report, we discuss the initial issues addressed in a research project aiming at the development of an advanced natural speech recognition system for the automatic processing of telephone directory requests. This multi-faceted project involves (1) t ...

IDIAP1999

LPC-based inversion of the DRM articulatory model

Sacha Krstulovic

Articulatory representations are expected to bring better speech recognition results. This requires to estimate the parameters of a speech production model from the speech sound, problem known as acoustico-articulatory inversion. Known methods to solve thi ...

1999

Extraction of Articulators in X-Ray Image Sequences

We describe a method for tracking tongue, lips, and throat in X-ray films showing the side-view of the vocal tract. The technique uses specialized histogram normalization techniques and a new tracking method that is robust against occlusion, noise, and spo ...

1999

Multi Modal Verification for Teleservices and Security Applications

Peter Ryser, Eric Meurville, Yousri Abdeljaoued, Remo Heimgartner

The paper presents the European ACTS project “M2VTS” which stands for Multi Modal Verification for Teleservices and Security Applications. The primary goal of this project is to address the issue of secured access to local and centralised services in a mul ...

1999

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.