Template-based ASR using Posterior features and synthetic references: comparing different TTS systems

In recent works, the use of phone class-conditional posterior probabilities (posterior features) directly as features provided successful results in template-based ASR systems. In this paper, motivated by the high quality of current text-to-speech systems and the robustness of posterior features toward undesired variability, we investigate the use of synthetic speech to generate reference templates. The use of synthetic speech in template-based ASR not only allows to address the issue of in-domain data collection but also expansion of vocabulary. On 75- and 600-word task-independent and speaker-independent setup of Phonebook corpus, we show the feasibility of this approach by investigating different synthetic voices produced by HTS-based synthesizer trained on two different databases. Our study shows that synthetic speech templates can yield performance comparable to the natural speech templates, especially with synthetic voices that have high intelligibility.

Chattez avec Graph Search

Posez n’importe quelle question sur les cours, conférences, exercices, recherches, actualités, etc. de l’EPFL ou essayez les exemples de questions ci-dessous.

AVERTISSEMENT : Le chatbot Graph n'est pas programmé pour fournir des réponses explicites ou catégoriques à vos questions. Il transforme plutôt vos questions en demandes API qui sont distribuées aux différents services informatiques officiellement administrés par l'EPFL. Son but est uniquement de collecter et de recommander des références pertinentes à des contenus que vous pouvez explorer pour vous aider à répondre à vos questions.

Template-based ASR using Posterior features and synthetic references: comparing different TTS systems

Graph Chatbot

Chattez avec Graph Search

incentive Mechanism Design for Responsible Data Governance: A Large-Scale Field Experiment

A Practical Influence Approximation for Privacy-Preserving Data Filtering in Federated Learning

Virtual metrology applied to milling process

Virtual metrology applied to milling process

incentive Mechanism Design for Responsible Data Governance: A Large-Scale Field Experiment

A Practical Influence Approximation for Privacy-Preserving Data Filtering in Federated Learning