Concept

Speech

Related publications (224)

Filter Bank Design for Subband Adaptive Beamforming and Application to Speech Recognition

\begin{abstract} We present a new filter bank design method for subband adaptive beamforming. Filter bank design for adaptive filtering poses many problems not encountered in more traditional applications such as subband coding of speech or music. The popu ...

IDIAP2008

Adaptive Beamforming with a Maximum Negentropy Criterion

Philip Neil Garner, Weifeng Li

\begin{abstract} In this paper, we address an adaptive beamforming application in realistic acoustic conditions. After the position of a speaker is estimated by a speaker tracking system, we construct a subband-domain beamformer in \emph{generalized sidelo ...

IDIAP2008

Adaptive Beamforming with a Maximum Negentropy Criterion

Philip Neil Garner, Weifeng Li

In this paper, we address an adaptive beamforming application in realistic acoustic conditions. After the position of a speaker is estimated by a speaker tracking system, we construct a subband-domain beamformer in generalized sidelobe canceller (GSC) conf ...

2008

Combination of Acoustic Classifiers based on Dempster-Shafer Theory of evidence

Hynek Hermansky, Fabio Valente

In this paper we investigate combination of neural net based classifiers using Dempster-Shafer Theory of Evidence. Under some assumptions, combination rule resembles a product of errors rule observed in human speech perception. Different combination are te ...

2007

Perception Studies on the Attributes of Synthetic Clear Speech for the Hard of Hearing

Chandra Sekhar Seelamantula

We make a case for ‘synthetic clear speech’ in the context of the persons with hearing impairment. We study the acoustic attributes of ‘clear speech’ that enable us to understand their importance in speech perception. Our perception experiments are motivat ...

IEEE2007

Correcting Confusion Matrices for Phone Recognizers

Modern speech recognition has many ways of quantifying the misrecognitions a speech recognizer makes. The errors in modern speech recognition makes extensive use of the Levenshtein algorithm to find the distance between the labeled target and the recognize ...

IDIAP2007

Low-Dimensional Motion Features for Audio-Visual Speech Recognition

Jean-Philippe Thiran, Mihai Gurban, Andrés Vallés

Audio-visual speech recognition promises to improve the performance of speech recognizers, especially when the audio is corrupted, by adding information from the visual modality, more specifically, from the video of the speaker. However, the number of visu ...

2007

Posterior-Based Features and Distances in Template Matching for Speech Recognition

Hervé Bourlard, Guillermo Aradilla

The use of large speech corpora in example-based approaches for speech recognition is mainly focused on increasing the number of examples. This strategy presents some difficulties because databases may not provide enough examples for some rare words. In th ...

2007

Posterior-Based Features and Distances in Template Matching for Speech Recognition

Hervé Bourlard, Guillermo Aradilla

IDIAP2007

Effective post-processing for single-channel frequency-domain speech enhancement

Weifeng Li

Conventional frequency-domain speech enhancement filters improve signal-to-noise ratio (SNR), but also produce speech distortions. This paper describes a novel post-processing algorithm devised for the improvement of the quality of the speech processed by ...

IDIAP2007

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.