Multitask adaptation with Lattice-Free MMI for multi-genre speech recognition of low resource languages

In this paper, we develop Automatic Speech Recognition (ASR) systems for multi-genre speech recognition of low-resource languages where training data is predominantly conversational speech but test data can be in one of the following genres: news broadcast, topical broadcast and conversational speech. ASR for low-resource languages is often developed by adapting a pre-trained model to a target language. When training data is predominantly from one genre and limited, the system's performance for other genres suffer. To handle such out-of-domain scenarios, we employ multitask adaptation by using auxiliary conversational speech data from other languages in addition to the target-language data. We aim to (1) improve adaptation through implicit data augmentation by adding other languages as auxiliary tasks, and (2) prevent the acoustic model from overfitting to the dominant genre in the training set. Pre-trained parameters are obtained from a multilingual model trained with data from 18 languages using the Lattice-Free Maximum Mutual Information (LF-MMI) criterion. The adaptation is performed with the LF-MMI criterion. We present results on MATERIAL datasets for three languages: Kazakh and Farsi and Pashto.

Chattez avec Graph Search

Posez n’importe quelle question sur les cours, conférences, exercices, recherches, actualités, etc. de l’EPFL ou essayez les exemples de questions ci-dessous.

AVERTISSEMENT : Le chatbot Graph n'est pas programmé pour fournir des réponses explicites ou catégoriques à vos questions. Il transforme plutôt vos questions en demandes API qui sont distribuées aux différents services informatiques officiellement administrés par l'EPFL. Son but est uniquement de collecter et de recommander des références pertinentes à des contenus que vous pouvez explorer pour vous aider à répondre à vos questions.

Multitask adaptation with Lattice-Free MMI for multi-genre speech recognition of low resource languages

Graph Chatbot

Chattez avec Graph Search

Data and code associated with the paper 'Mode-Specific Coupling of Nanoparticle-on-Mirror Cavities with Cylindrical Vector Beams'

ALMOST SURE SCATTERING OF THE ENERGY-CRITICAL NLS IN d > 6

Data and Workflow to: Three-dimensional buoyant hydraulic fractures: finite volume release (Möri and Lecampion, (2023))

Data and Workflow to: Three-dimensional buoyant hydraulic fractures: finite volume release (Möri and Lecampion, (2023))

Data and code associated with the paper 'Mode-Specific Coupling of Nanoparticle-on-Mirror Cavities with Cylindrical Vector Beams'

ALMOST SURE SCATTERING OF THE ENERGY-CRITICAL NLS IN d > 6