Publication

Objective Intelligibility Assessment of Text-to-Speech Systems Through Utterance Verification

Related publications (40)

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

Unified Framework Of Feature Based Adaptation For Statistical Speech Synthesis And Recognition

Lakshmi Babu Saheer

The advent of statistical parametric speech synthesis has paved new ways to a unified framework for hidden Markov model (HMM) based text to speech synthesis (TTS) and automatic speech recognition (ASR). The techniques and advancements made in the field of ...

Ecole Polytechnique Federale de Lausanne (EPFL)2012

Multi-parametric source-filter separation of speech and prosodic voice restoration

Olaf Schleusing

In this thesis, methods and models are developed and presented aiming at the estimation, restoration and transformation of the characteristics of human speech. During a first period of the thesis, a concept was developed that allows restoring prosodic voic ...

EPFL2012

Synthetic References for Template-based ASR using Posterior Features

Hervé Bourlard, Serena Soldo

Recently, the use of phoneme class-conditional probabilities as features (posterior features) for template-based ASR has been proposed. These features have been found to generalize well to unseen data and yield better systems than standard spectral-based f ...

2012

Enhanced Phone Posteriors for Improving Speech Recognition Systems

Hervé Bourlard, Hamed Ketabdar

Using phone posterior probabilities has been increasingly explored for improving automatic speech recognition (ASR) systems. In this paper, we propose two approaches for hierarchically enhancing these phone posteriors, by integrating long acoustic context, ...

2010

AMIDA/Klewel Mini-Project

Petr Motlicek, Philip Neil Garner, Vincent Bozzo

The goal of the AMIDA mini-project is to transfer some of the technologies developed within the AMIDA project to be used by a Klewel retrieval system. More specifically, the main focus is to develop a speech-to-text application based on the AMIDA Automatic ...

Idiap2010

Measuring the gap between HMM-based ASR and TTS

John David Scott Dines, Simon King

The EMIME European project is conducting research in the development of technologies for mobile, personalised speech-tospeech translation systems. The hidden Markov model is being used as the underlying technology in both automatic speech recognition (ASR) ...

2009

Measuring the gap between HMM-based ASR and TTS

John David Scott Dines, Simon King

Idiap2009

How does a dictation machine recognize speech?

Hervé Bourlard

There is magic (or is it witchcraft?) in a speech recognizer that transcribes continuous radio speech into text with a word accuracy of even not more than 50%. The extreme difficulty of this task, tough, is usually not perceived by the general public. This ...

Idiap2008

A Kernel Trick For Sequences Applied to Text-Independent Speaker Verification Systems

Samy Bengio

This paper present a principled SVM based speaker verification system. We propose a new framework and a new sequence kernel that can make use of any Mercer kernel at the frame level. An extension of the sequence kernel based on the Max operator is also pro ...

2007

Using Pitch as Prior Knowledge in Template-Based Speech Recognition

Hervé Bourlard, Guillermo Aradilla

In a previous paper on speech recognition, we showed that templates can better capture the dynamics of speech signal compared to parametric models such as hidden Markov models. The key point in template matching approaches is finding the most similar templ ...

2006