Publication

Dysarthric Speech Recognition with Lattice-Free MMI

Publications associées (32)

Graph Chatbot

Chattez avec Graph Search

Posez n’importe quelle question sur les cours, conférences, exercices, recherches, actualités, etc. de l’EPFL ou essayez les exemples de questions ci-dessous.

AVERTISSEMENT : Le chatbot Graph n'est pas programmé pour fournir des réponses explicites ou catégoriques à vos questions. Il transforme plutôt vos questions en demandes API qui sont distribuées aux différents services informatiques officiellement administrés par l'EPFL. Son but est uniquement de collecter et de recommander des références pertinentes à des contenus que vous pouvez explorer pour vous aider à répondre à vos questions.

Connectez-vous pour utiliser Chat avec Graph Search

Multi-parametric source-filter separation of speech and prosodic voice restoration

Olaf Schleusing

In this thesis, methods and models are developed and presented aiming at the estimation, restoration and transformation of the characteristics of human speech. During a first period of the thesis, a concept was developed that allows restoring prosodic voic ...

EPFL2012

Improving non-native ASR through stochastic multilingual phoneme space transformations

Hervé Bourlard, Philip Neil Garner, John David Scott Dines, David Imseng

We propose a stochastic phoneme space transformation technique that allows the conversion of conditional source phoneme posterior probabilities (conditioned on the acoustics) into target phoneme posterior probabilities. The source and target phonemes can b ...

2011

Improving non-native ASR through stochastic multilingual phoneme space transformations

Hervé Bourlard, Philip Neil Garner, John David Scott Dines, David Imseng

Idiap2011

Cross-Lingual Speaker Discrimination Using Natural and Synthetic Speech

Hui Liang

This paper describes speaker discrimination experiments in which native English listeners were presented with natural speech stimuli in English and Mandarin, synthetic speech stimuli in English and Mandarin, or natural Mandarin speech and synthetic English ...

2011

Cross-Lingual Speaker Discrimination Using Natural and Synthetic Speech

Hui Liang

Idiap2011

Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods

Samy Bengio

This is the first book dedicated to uniting research related to speech and speaker recognition based on the recent advances in large margin and kernel methods. The first part of the book presents theoretical and practical foundations of large margin and ke ...

John Wiley & Sons2008

The Multi-Channel Wall Street Journal Audio Visual Corpus (MC-WSJ-AV): Specification and Initial Experiments

The recognition of speech in meetings poses a number of challenges to current Automatic Speech Recognition (ASR) techniques. Meetings typically take place in rooms with non-ideal acoustic conditions and significant background noise, and may contain large s ...

IDIAP2005

Robust audio segmentation

Jitendra Ajmera

Audio segmentation, in general, is the task of segmenting a continuous audio stream in terms of acoustically homogenous regions, where the rule of homogeneity depends on the task. This thesis aims at developing and investigating efficient, robust and unsup ...

EPFL2005

Robust Audio Segmentation

Hervé Bourlard, Jitendra Ajmera

IDIAP2004

Robust Audio Segmentation

Hervé Bourlard, Jitendra Ajmera

École Polytechnique Fédérale de Lausanne2004