Publication

Spectro-Temporal Activity Pattern (STAP) Features for Noise Robust ASR

Related publications (51)

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

Phase AutoCorrelation (PAC) derived Robust Speech Features

Hervé Bourlard, Hemant Misra, Shajith Ikbal

In this paper, we introduce a new class of noise robust acoustic features derived from a new measure of autocorrelation, and explicitly exploiting the phase variation of the speech signal frame over time. This family of features, referred to as ``Phase Aut ...

IDIAP2002

Increasing Speech Recognition Noise Robustness with HMM2

Hervé Bourlard, Samy Bengio, Katrin Weber

The purpose of this paper is to investigate the behavior of HMM2 models for the recognition of noisy speech. It has previously been shown that HMM2 is able to model dynamically important structural information inherent in the speech signal, often correspon ...

2002

Robust speech recognition based on multi-stream processing

Astrid Hagen

Despite sophisticated present day automatic speech recognition (ASR) techniques, a single recognizer is usually incapable of accounting for the varying conditions in a typical natural environment. Higher robustness to a range of noise cases can potentially ...

École Polytechnique Fédérale de Lausanne2001

Robust speech recognition based on multi-stream processing

Astrid Hagen

2001

Robust Speech Recognition and Feature Extraction Using HMM2

Hervé Bourlard, Samy Bengio, Katrin Weber, Shajith Ikbal

This paper presents the theoretical basis and preliminary experimental results of a new HMM model, referred to as HMM2, which can be considered as a mixture of HMMs. In this new model, the emission probabilities of the temporal (primary) HMM are estimated ...

IDIAP2001

Increasing Speech Recognition Noise Robustness with HMM2

Hervé Bourlard, Samy Bengio, Katrin Weber

IDIAP2001

The Elisa'99 Speaker Recognition and Tracking Systems

This article presents the text-independent speaker verification and tracking systems developed by the {ELISA} consortium for the {NIST}'99 speaker recognition campaign. {ELISA} is a consortium grouping European researchers of several laboratories sharing r ...

1999

Using the Multi-Stream Approach for Continuous Audio-Visual Speech Recognition

The Multi-Stream automatic speech recognition approach was investigated in this work as a framework for Audio-Visual data fusion and speech recognition. This method presents many potential advantages for such a task. It particularly allows for synchronous ...

IDIAP1997

Speaker Verification in the Telephone Network : Research Activities in the CAVE Project

This paper summarizes the main results from the Speaker Verification (SV) research pursued so far in the CAVE project. Different state-of-the art SV algorithms were implemented in a common HMM framework and compared on two databases : YOHO (office environm ...

1997

Secured vocal access to telephone servers

Dominique Genoud

A number of applications of man-machine interaction over the telephone requires a combination of speech recognition and speaker verification. This paper describes current work carried out at IDIAP in the framework of national and European projects. A gener ...

IDIAP / CNRS1996