Publications associées (7)

Cepstral normalisation and the signal to noise ratio spectrum in automatic speech recognition.

Philip Neil Garner

Cepstral normalisation in automatic speech recognition is investigated in the context of robustness to additive noise. It is argued that such normalisation leads naturally to a speech feature based on signal to noise ratio rather than absolute energy (or p ...
Idiap2011

Cepstral normalisation and the signal to noise ratio spectrum in automatic speech recognition

Philip Neil Garner

Cepstral normalisation in automatic speech recognition is investigated in the context of robustness to additive noise. In this paper, it is argued that such normalisation leads naturally to a speech feature based on signal to noise ratio rather than absolu ...
2011

Heat transport in model jammed solids

Matthieu Wyart, Ning Xu

We calculate numerically the normal modes of vibrations in three-dimensional jammed packings of soft spheres as a function of the packing fraction and obtain the energy diffusivity, a spectral measure of transport that controls sound propagation and therma ...
2010

Speaker Change Detection with Privacy-Preserving Audio Cues

Hervé Bourlard, Daniel Gatica-Perez, Sree Hari Krishnan Parthasarathi

In this paper we investigate a set of privacy-sensitive audio features for speaker change detection (SCD) in multiparty conversations. These features are based on three different principles: characterizing the excitation source information using linear pre ...
2009

Speaker Change Detection with Privacy-Preserving Audio Cues

Hervé Bourlard, Daniel Gatica-Perez, Sree Hari Krishnan Parthasarathi

In this paper we investigate a set of privacy-sensitive audio features for speaker change detection (SCD) in multiparty conversations. These features are based on three different principles: characterizing the excitation source information using linear pre ...
Idiap2009

Novel speech processing techniques for robust automatic speech recognition

Vivek Tyagi

The goal of this thesis is to develop and design new feature representations that can improve the automatic speech recognition (ASR) performance in clean as well noisy conditions. One of the main shortcomings of the fixed scale (typically 20-30 ms long ana ...
EPFL2006

On Multi-scale Fourier Transform Analysis of Speech Signals

Hervé Bourlard, Vivek Tyagi

In this paper, we introduce a novel algorithm to perform multi-scale Fourier transform analysis of piecewise stationary signals with application to automatic speech recognition. Such signals are composed of quasi-stationary segments of variable lengths. Th ...
IDIAP2003

Graph Chatbot

Chattez avec Graph Search

Posez n’importe quelle question sur les cours, conférences, exercices, recherches, actualités, etc. de l’EPFL ou essayez les exemples de questions ci-dessous.

AVERTISSEMENT : Le chatbot Graph n'est pas programmé pour fournir des réponses explicites ou catégoriques à vos questions. Il transforme plutôt vos questions en demandes API qui sont distribuées aux différents services informatiques officiellement administrés par l'EPFL. Son but est uniquement de collecter et de recommander des références pertinentes à des contenus que vous pouvez explorer pour vous aider à répondre à vos questions.