Publication

VTLN-Based Rapid Cross-Lingual Adaptation for Statistical Parametric Speech Synthesis

Related publications (41)

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

Characterization of natural sand under complex dynamic loading

Laurent Vulliet, Emilie Rascol

An experimental study, conducted with an advanced triaxial press, evaluates the effect of different loading paths on the dynamic behavior of natural Swiss sand. Emphasis is put on comparison between the dynamic parameters evaluated in a single loading (wit ...

Taylor & Francis Group2009

Modulation Frequency Features For Phoneme Recognition In Noisy Speech

Hynek Hermansky, Sriram Ganapathy, Samuel Thomas

In this letter, a new feature extraction technique based on modulation spectrum derived from syllable-length segments of sub-band temporal envelopes is proposed. These sub-band envelopes are derived from auto-regressive modelling of Hilbert envelopes of th ...

2008

Modulation Frequency Features For Phoneme Recognition In Noisy Speech

Hynek Hermansky, Sriram Ganapathy, Samuel Thomas

In this paper, a new feature extraction technique based on modulation spectrum derived from syllable-length segments of sub-band temporal envelopes is proposed. These sub-band envelopes are derived from auto-regressive modelling of Hilbert envelopes of the ...

Idiap2008

Robust overlapping speech recognition based on neural networks

John David Scott Dines, Weifeng Li

We address issues for improving hands-free speech recognition performance in the presence of multiple simultaneous speakers using multiple distant microphones. In this paper, a log spectral mapping is proposed to estimate the log mel-filterbank outputs of ...

IDIAP2007

Continuous Microphone Array Speech Recognition on Wall Street Journal Corpus

Hervé Bourlard

In this paper, we present a robust speech acquisition system to acquire continuous speech using a microphone array. A microphone array based speech recognition system is also presented to study the environmental interference due to reverberation, backgroun ...

IDIAP2005

Using auxiliary sources of knowledge for automatic speech recognition

Mathew Magimai Doss

Standard hidden Markov model (HMM) based automatic speech recognition (ASR) systems usually use cepstral features as acoustic observation and phonemes as subword units. Speech signal exhibits wide range of variability such as, due to environmental variatio ...

EPFL2005

Using Auxiliary Sources of Knowledge for Automatic Speech Recognition

École Polytechnique Fédérale de Lausanne, Computer Science Department2005

Using Auxiliary Sources of Knowledge for Automatic Speech Recognition

IDIAP2005

LPC-based inversion of the DRM articulatory model

Sacha Krstulovic

Articulatory representations are expected to bring better speech recognition results. This requires to estimate the parameters of a speech production model from the speech sound, problem known as acoustico-articulatory inversion. Known methods to solve thi ...

1999

Acoustico-articulatory inversion of unequal-length tube models through lattice inverse filtering

Sacha Krstulovic

Constraints related to the Distinctive Regions and Modes (DRM) speech production model are incorporated in the framework of speech analysis by inverse filtering. It is shown that the analogy between Auto-Regressive modeling and acoustic models based on aco ...

IDIAP1998