Publication

Multichannel Speech Enhancement in Cars: Explicit vs. Implicit Adaptation Control

Related publications (42)

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

Towards using slide information to enhance speech transcription of meetings

Hervé Bourlard, Artem Peregoudov, Alessandro Vinciarelli

In this paper we investigate the possibility of improving the speech recognition performance of meeting recordings by using slides captured during the recording process. The key hypothesis exploited in this work is that both slides and speech carry correla ...

IDIAP2006

Efficient integration of automated speech recognition in the framework of dialogue-based vocal systems

In this work, we propose different strategies for efficiently integrating an automated speech recognition module in the framework of a dialogue-based vocal system. The aim is the study of different ways leading to the improvement of the quality and robustn ...

EPFL2005

Sector-Based Detection for Hands-Free Speech Enhancement in Cars

Guillaume Lathoud

Speech-based command interfaces are becoming more and more common in cars. Applications include automatic dialog systems for hands-free phone calls as well as more advanced features such as navigation systems. However, interferences, such as speech from th ...

IDIAP2004

A Sector-Based Approach for Localization of Multiple Speakers with Microphone Arrays

Guillaume Lathoud

Microphone arrays are useful in meeting rooms, where speech needs to be acquired and segmented. For example, automatic speech segmentation allows enhanced browsing experience, and facilitates automatic analysis of large amounts of data. Spontaneous multi-p ...

IDIAP2004

A Sector-Based Approach for Localization of Multiple Speakers with Microphone Arrays

Guillaume Lathoud

2004

An Online Audio Indexing System

Hervé Bourlard, Jitendra Ajmera

This paper presents overview of an online audio indexing system, which creates a searchable index of speech content embedded in digitized audio files. This system is based on our recently proposed offline audio segmentation techniques. As the data arrives ...

2004

An Online Audio Indexing System

Hervé Bourlard, Jitendra Ajmera

IDIAP2003

Some Emerging Concepts in Speech Recognition.

Hervé Bourlard, Hynek Hermansky

The paper presents a work-in-progress on several emerging concepts in Automatic Speech Recognition (ASR), that are being currently studied at IDIAP. This work can be roughly categorized into three categories: 1) data-guided features, 2) features based on m ...

IDIAP2003

A Pragmatic View of the Application of HMM2 for ASR

Hervé Bourlard, Samy Bengio, Katrin Weber

This report investigates the HMM2 approach recently introduced in the framework of automatic speech recognition. HMM2 can be seen as a mixture of HMMs, where a conventional primary HMM (processing a time series of speech data) is supported on a lower level ...

IDIAP2001

Speech Recognition Engine for Interactive Voice Response application on Windows

This paper is a report for the Postgraduate course Language and Speech Engineering. The report describes the part work of InfoVOX project, the goal is to implement Speech Recognition Engine (SRE) on Windows with state-of-the-art SR technologies, and integr ...

IDIAP2001