Publication

Vocal Tract Length Normalization for Statistical Parametric Speech Synthesis

Publications associées (38)

Graph Chatbot

Chattez avec Graph Search

Posez n’importe quelle question sur les cours, conférences, exercices, recherches, actualités, etc. de l’EPFL ou essayez les exemples de questions ci-dessous.

AVERTISSEMENT : Le chatbot Graph n'est pas programmé pour fournir des réponses explicites ou catégoriques à vos questions. Il transforme plutôt vos questions en demandes API qui sont distribuées aux différents services informatiques officiellement administrés par l'EPFL. Son but est uniquement de collecter et de recommander des références pertinentes à des contenus que vous pouvez explorer pour vous aider à répondre à vos questions.

Connectez-vous pour utiliser Chat avec Graph Search

Using Pitch as Prior Knowledge in Template-Based Speech Recognition

Hervé Bourlard, Guillermo Aradilla

In a previous paper on speech recognition, we showed that templates can better capture the dynamics of speech signal compared to parametric models such as hidden Markov models. The key point in template matching approaches is finding the most similar templ ...

2006

Using Pitch as Prior Knowledge in Template-Based Speech Recognition

Hervé Bourlard, Guillermo Aradilla

IDIAP2005

Some Emerging Concepts in Speech Recognition.

Hervé Bourlard, Hynek Hermansky

The paper presents a work-in-progress on several emerging concepts in Automatic Speech Recognition (ASR), that are being currently studied at IDIAP. This work can be roughly categorized into three categories: 1) data-guided features, 2) features based on m ...

IDIAP2003

Evaluation of Formant-Like Features for ASR

Hervé Bourlard, Samy Bengio, Katrin Weber

This paper investigates possibilities to automatically find a low-dimensional, formant-related physical representation of the speech signal, which is suitable for automatic speech recognition (ASR). This aim is motivated by the fact that formants have been ...

IDIAP2002

Evaluation of Formant-Like Features for ASR

Hervé Bourlard, Samy Bengio, Katrin Weber

2002

A Pragmatic View of the Application of HMM2 for ASR

Hervé Bourlard, Samy Bengio, Katrin Weber

This report investigates the HMM2 approach recently introduced in the framework of automatic speech recognition. HMM2 can be seen as a mixture of HMMs, where a conventional primary HMM (processing a time series of speech data) is supported on a lower level ...

IDIAP2001

Automatic Speech Recognition using Pitch Information in Dynamic Bayesian Networks

Hervé Bourlard

The challenge of automatic speech recognition (ASR) increases when speaker variability is encountered. Being able to automatically use different acoustic models according to speaker type might help to increase the robustness of ASR. We present a system tha ...

IDIAP2000

Statistical lip modelling for visual speech recognition

We describe a speechreading (lipreading) system purely based on visual features extracted from grey level image sequences of the speakers lips. Active shape models are used to track the lip contours while visual speech information is extracted from the sha ...

1996