Current trends in multilingual speech processing

Hervé Bourlard, Petr Motlicek, Philip Neil Garner, John David Scott Dines, Fabio Valente, David Imseng, Hui Liang, Lakshmi Babu Saheer
2011
Journal paper

Abstract

In this paper, we describe recent work at Idiap Research Institute in the domain of multilingual speech processing and provide some insights into emerging challenges for the research community. Multilingual speech processing has been a topic of ongoing interest to the research community for many years and the field is now receiving renewed interest owing to two strong driving forces. Firstly, technical advances in speech recognition and synthesis are posing new challenges and opportunities to researchers. For example, discriminative features are seeing wide application by the speech recognition community, but additional issues arise when using such features in a multilingual setting. Another example is the apparent convergence of speech recognition and speech synthesis technologies in the form of statistical parametric methodologies. This convergence enables the investigation of new approaches to unified modelling for automatic speech recognition and text-to-speech synthesis (TTS) as well as cross-lingual speaker adaptation for TTS. The second driving force is the impetus being provided by both government and industry for technologies to help break down domestic and international language barriers, these also being barriers to the expansion of policy and commerce. Speech-to-speech and speech-to-text translation are thus emerging as key technologies at the heart of which lies multilingual speech processing.

Official source

https://infoscience.epfl.ch/record/178603?ln=en

About this result

This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.

Current trends in multilingual speech processing

Graph Chatbot

Chat with Graph Search

Training a Filter-Based Model of the Cochlea in the Context of Pre-Trained Acoustic Models

Novel Methods For Detection And Analysis Of Atypical Aspects In Speech

Sparse Autoencoders for Speech Modeling and Recognition

Training a Filter-Based Model of the Cochlea in the Context of Pre-Trained Acoustic Models

Novel Methods For Detection And Analysis Of Atypical Aspects In Speech

Sparse Autoencoders for Speech Modeling and Recognition