Publication

Non-linear Spectral Contrast Stretching for In-car Speech Recognition

Related publications (35)

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

Correcting Confusion Matrices for Phone Recognizers

Modern speech recognition has many ways of quantifying the misrecognitions a speech recognizer makes. The errors in modern speech recognition makes extensive use of the Levenshtein algorithm to find the distance between the labeled target and the recognize ...

IDIAP2007

Novel speech processing techniques for robust automatic speech recognition

Vivek Tyagi

The goal of this thesis is to develop and design new feature representations that can improve the automatic speech recognition (ASR) performance in clean as well noisy conditions. One of the main shortcomings of the fixed scale (typically 20-30 ms long ana ...

EPFL2006

Efficient integration of automated speech recognition in the framework of dialogue-based vocal systems

In this work, we propose different strategies for efficiently integrating an automated speech recognition module in the framework of a dialogue-based vocal system. The aim is the study of different ways leading to the improvement of the quality and robustn ...

EPFL2005

Spectro-Temporal Activity Pattern (STAP) Features for Noise Robust ASR

Hervé Bourlard, Hemant Misra, Shajith Ikbal

In this paper, we introduce a new noise robust representation of speech signal obtained by locating points of potential importance in the spectrogram, and parameterizing the activity of time-frequency pattern around those points. These features are referre ...

2004

Spectro-Temporal Activity Pattern (STAP) Features for Noise Robust ASR

Hervé Bourlard, Hemant Misra, Shajith Ikbal

IDIAP2004

Robust Speech Recognition and Feature Extraction Using HMM2

Hervé Bourlard, Samy Bengio, Katrin Weber, Shajith Ikbal

This paper presents the theoretical basis and preliminary experimental results of a new HMM model, referred to as HMM2, which can be considered as a mixture of HMMs. In this new model, the emission probabilities of the temporal (primary) HMM are estimated ...

2003

HMM mixtures (HMM2) for robust speech recognition

Katrin Weber

State-of-the-art automatic speech recognition (ASR) techniques are typically based on hidden Markov models (HMMs) for the modeling of temporal sequences of feature vectors extracted from the speech signal. At the level of each HMM state, Gaussian mixture m ...

EPFL2003

HMM Mixtures (HMM2) for Robust Speech Recognition

Katrin Weber

Ecole Polytechnique Federale de Lausanne2003

HMM Mixtures (HMM2) for Robust Speech Recognition

Katrin Weber

IDIAP2003

Increasing Speech Recognition Noise Robustness with HMM2

Hervé Bourlard, Samy Bengio, Katrin Weber

The purpose of this paper is to investigate the behavior of HMM2 models for the recognition of noisy speech. It has previously been shown that HMM2 is able to model dynamically important structural information inherent in the speech signal, often correspon ...

2002