Related publications (38)

Multitask adaptation with Lattice-Free MMI for multi-genre speech recognition of low resource languages

Hervé Bourlard, Petr Motlicek

In this paper, we develop Automatic Speech Recognition (ASR) systems for multi-genre speech recognition of low-resource languages where training data is predominantly conversational speech but test data can be in one of the following genres: news broadcast ...
ISCA-INT SPEECH COMMUNICATION ASSOC2021

Explainable Phonology-based Approach for Sign Language Recognition and Assessment

Sandrine Tornay

Sign language technology, unlike spoken language technology, is an emerging area of research. Sign language technologies can help in bridging the gap between the Deaf community and the hearing community. One such computer-aided technology is sign language ...
EPFL2021

Lightweight Cross-Lingual Sentence Representation Learning

Martin Jaggi, Prakhar Gupta, Zhuoyuan Mao

Large-scale models for learning fixed-dimensional cross-lingual sentence representations like LASER (Artetxe and Schwenk, 2019b) lead to significant improvement in performance on downstream tasks. However, further increases and modifications based on such ...
ASSOC COMPUTATIONAL LINGUISTICS-ACL2021

Multilingual and Unsupervised Subword Modeling for Zero-Resource Languages

Enno Hermann

Subword modeling for zero-resource languages aims to learn low-level representations of speech audio without using transcriptions or other resources from the target language (such as text corpora or pronunciation dictionaries). A good representation should ...
2020

Neural Network Based End-to-End Query by Example Spoken Term Detection

Hervé Bourlard, Dhananjay Ram

This article focuses on the problem of query by example spoken term detection (QbE-STD) in zero-resource scenario. State-of-the-art approaches primarily rely on dynamic time warping (DTW) based template matching techniques using phone posterior or bottlene ...
IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC2020

Language Independent Query by Example Spoken Term Detection

Dhananjay Ram

Language independent query-by-example spoken term detection (QbE-STD) is the problem of retrieving audio documents from an archive, which contain a spoken query provided by a user. This is usually casted as a hypothesis testing and pattern matching problem ...
EPFL2019

Cross-lingual Adaptation of a CTC-based multilingual Acoustic Model

Hervé Bourlard, Philip Neil Garner, Sibo Tong

Multilingual models for Automatic Speech Recognition (ASR) are attractive as they have been shown to benefit from more training data, and better lend themselves to adaptation to under-resourced languages. However, initialisation from monolingual context-de ...
ELSEVIER SCIENCE BV2018

Fast Language Adaptation Using Phonological Information

Hervé Bourlard, Philip Neil Garner, Sibo Tong

Phoneme-based multilingual connectionist temporal classification (CTC) model is easily extensible to a new language by concatenating parameters of the new phonemes to the output layer. In the present paper, we improve cross-lingual adaptation in the contex ...
ISCA-INT SPEECH COMMUNICATION ASSOC2018

Building the Moroccan Darija WordNet (MDW) using Bilingual Resources

Khalil Mrini

Moroccan Darija is one of the Arabic dialects, a continuum of under-resourced vernaculars. We develop a Moroccan Darija Wordnet (MDW) using a bilingual Moroccan-English dictionary, from which we collect nearly 13,000 definitions and over 15,000 lemmas. A M ...
2017

Cross-lingual Transfer for News Article Labeling: Benchmarking Statistical and Neural Models

Andrei Popescu-Belis, Nikolaos Pappas, Khalil Mrini

Cross-lingual transfer has been shown to increase the performance of a text classification model thanks to the use of Multilingual Hierarchical Attention Networks (MHAN), on which this work is based. Firstly, we compared the performance of monolingual and ...
Idiap2017

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.