Publication

Multi-stream Processing for Noise Robust Speech Recognition

Related publications (44)

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

In-Context Phone Posteriors as Complementary Features for Tandem ASR

Hervé Bourlard, Hamed Ketabdar

In this paper, we present a method for integrating possible prior knowledge (such as phonetic and lexical knowledge), as well as acoustic context (e.g., the whole utterance) in the phone posterior estimation, and we propose to use the obtained posteriors a ...

2008

A multimodal pattern recognition framework for speaker detection

Patricia Besson

Speaker detection is an important component of a speech-based user interface. Audiovisual speaker detection, speech and speaker recognition or speech synthesis for example find multiple applications in human-computer interaction, multimedia content indexin ...

EPFL2007

ON THE INEQUALITIES IN INFORMATION THEORY

Rethnakaran Pulikkoonattu

Claude Elwood Shannon in 1948, then of the Bell Labs, published one of the ground breaking papers in the history of engineering [1]. This paper (”A Mathematical Theory of Communication”, Bell System Tech. Journal, Vol. 27, July and October 1948, pp. 379 - ...

2007

Multi-stream Processing for Noise Robust Speech Recognition

Hemant Misra

In this thesis, the framework of multi-stream combination has been explored to improve the noise robustness of automatic speech recognition (ASR) systems. The central idea of multi-stream ASR is to combine information from several sources to improve the pe ...

École Polytechnique Fédérale de Lausanne2006

Multi-stream processing for noise robust speech recognition

Hemant Misra

EPFL2006

Spectral Entropy Feature in Multi-stream for Robust ASR

Hervé Bourlard, Hemant Misra

In recent papers, entropy computed from sub-bands of the spectrum was used as a feature for automatic speech recognition. In the present paper, we further study the sub-band spectral entropy features which can give the flatness/peakiness of the sub-band sp ...

IDIAP2005

Multi-resolution Spectral Entropy Based Feature for Robust ASR

Hervé Bourlard, Hemant Misra, Shajith Ikbal

Recently, entropy measures at different stages of recognition have been used in automatic speech recognition (ASR) task. In a recent paper, we proposed that formant positions of a spectrum can be captured by multi-resolution spectral entropy feature. In th ...

2005

Hierarchical Multi-Stream Posterior Based Speech Recognition System

Hervé Bourlard, Samy Bengio, Hamed Ketabdar

In this paper, we present initial results towards boosting posterior based speech recognition systems by estimating more informative posteriors using multiple streams of features and taking into account acoustic context (e.g., as available in the whole utt ...

2005

Hierarchical Multi-Stream Posterior Based Speech Recognition System

Hervé Bourlard, Samy Bengio, Hamed Ketabdar

IDIAP2005

Multi-stream ASR: Oracle Test and Embedded Training

Hervé Bourlard, Hemant Misra

Multi-stream based automatic speech recognition (ASR) systems outperform their single stream counterparts, especially in the case of noisy speech. However, the main issues in multi-stream systems are to know a) Which streams to be combined, and b) How to c ...

IDIAP2005