Related publications (7)

Cepstral normalisation and the signal to noise ratio spectrum in automatic speech recognition.

Philip Neil Garner

Cepstral normalisation in automatic speech recognition is investigated in the context of robustness to additive noise. It is argued that such normalisation leads naturally to a speech feature based on signal to noise ratio rather than absolute energy (or p ...
Idiap2011

Cepstral normalisation and the signal to noise ratio spectrum in automatic speech recognition

Philip Neil Garner

Cepstral normalisation in automatic speech recognition is investigated in the context of robustness to additive noise. In this paper, it is argued that such normalisation leads naturally to a speech feature based on signal to noise ratio rather than absolu ...
2011

Heat transport in model jammed solids

Matthieu Wyart, Ning Xu

We calculate numerically the normal modes of vibrations in three-dimensional jammed packings of soft spheres as a function of the packing fraction and obtain the energy diffusivity, a spectral measure of transport that controls sound propagation and therma ...
2010

Speaker Change Detection with Privacy-Preserving Audio Cues

Hervé Bourlard, Daniel Gatica-Perez, Sree Hari Krishnan Parthasarathi

In this paper we investigate a set of privacy-sensitive audio features for speaker change detection (SCD) in multiparty conversations. These features are based on three different principles: characterizing the excitation source information using linear pre ...
2009

Speaker Change Detection with Privacy-Preserving Audio Cues

Hervé Bourlard, Daniel Gatica-Perez, Sree Hari Krishnan Parthasarathi

In this paper we investigate a set of privacy-sensitive audio features for speaker change detection (SCD) in multiparty conversations. These features are based on three different principles: characterizing the excitation source information using linear pre ...
Idiap2009

Novel speech processing techniques for robust automatic speech recognition

Vivek Tyagi

The goal of this thesis is to develop and design new feature representations that can improve the automatic speech recognition (ASR) performance in clean as well noisy conditions. One of the main shortcomings of the fixed scale (typically 20-30 ms long ana ...
EPFL2006

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.