Publication

Sector-Based Detection for Hands-Free Speech Enhancement in Cars

Related publications (45)

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

Application of Out-Of-Language Detection To Spoken-Term Detection

Petr Motlicek, Fabio Valente

This paper investigates the detection of English spoken terms in a conversational multi-language scenario. The speech is processed using a large vocabulary continuous speech recognition system. The recognition output is represented in the form of word reco ...

Idiap2010

VTLN Adaptation for Statistical Speech Synthesis

Philip Neil Garner, John David Scott Dines, Hui Liang, Lakshmi Babu Saheer

The advent of statistical speech synthesis has enabled the unification of the basic techniques used in speech synthesis and recognition. Adaptation techniques that have been successfully used in recognition systems can now be applied to synthesis systems t ...

2010

Towards mixed language speech recognition systems

Hervé Bourlard, David Imseng

Multilingual speech recognition obviously involves numerous research challenges, including common phoneme sets, adaptation on limited amount of training data, as well as mixed language recognition (common in many countries, like Switzerland). In this latte ...

2010

Towards mixed language speech recognition systems

Hervé Bourlard, David Imseng

Idiap2010

VTLN Adaptation for Statistical Speech Synthesis

Philip Neil Garner, John David Scott Dines, Hui Liang, Lakshmi Babu Saheer

Idiap2009

Robust Speaker Diarization for Short Speech Recordings

David Imseng

We investigate a state-of-the-art Speaker Diarization system regarding its behavior on meetings that are much shorter (from 500 seconds down to 100 seconds) than those typically analyzed in Speaker Diarization benchmarks. First, the problems inherent to th ...

2009

Robust Speaker Diarization for Short Speech Recordings

David Imseng

Idiap2009

Robust Pedestrian Navigation for Challenging Applications

Pierre-Yves Gilliéron

Presentation of a concept for robust indoor navigation. The concept is based on three key elements: - the use of an absolute geographical reference - the hybridisation of complementary technologies - specific motion models. This concept is illustrated by t ...

2009

How does a dictation machine recognize speech?

Hervé Bourlard

There is magic (or is it witchcraft?) in a speech recognizer that transcribes continuous radio speech into text with a word accuracy of even not more than 50%. The extreme difficulty of this task, tough, is usually not perceived by the general public. This ...

Idiap2008

Machine Learning for Multimodal Interaction IV

Hervé Bourlard, Andrei Popescu-Belis, Steve Renals

This book constitutes the thoroughly refereed post-proceedings of the 4th International Workshop on Machine Learning for Multimodal Interaction, MLMI 2007, held in Brno, Czech Republic, in June 2007. The 25 revised full papers presented together with 1 inv ...

Springer-Verlag2008