Publication

Multi-parametric source-filter separation of speech and prosodic voice restoration

Publications associées (211)

Graph Chatbot

Chattez avec Graph Search

Posez n’importe quelle question sur les cours, conférences, exercices, recherches, actualités, etc. de l’EPFL ou essayez les exemples de questions ci-dessous.

AVERTISSEMENT : Le chatbot Graph n'est pas programmé pour fournir des réponses explicites ou catégoriques à vos questions. Il transforme plutôt vos questions en demandes API qui sont distribuées aux différents services informatiques officiellement administrés par l'EPFL. Son but est uniquement de collecter et de recommander des références pertinentes à des contenus que vous pouvez explorer pour vous aider à répondre à vos questions.

Connectez-vous pour utiliser Chat avec Graph Search

Automatic Speech Receognition for Human-Machine Interaction

Pierre-André Farine, Michael Ansorge, Sara Grassi Pauletti

Since the sixties, movies such as “2001: A Space Odyssey” have familiarized us with the idea of com-puters that can speak and hear just as a human being does. Automatic speech recogni-tion (ASR) is the technol-ogy that allows machines to interpret human sp ...

2005

A Frequency-Domain Silence Noise Model

Guillaume Lathoud, Bertrand Mesot

This paper proposes a simple, computationally efficient 2-mixture model approach to discrimination between speech and background noise. It is directly derived from observations on real data, and can be used in a fully unsupervised manner, with the EM algor ...

IDIAP2005

A Spectrogram Model for Enhanced Source Localization and Noise-Robust ASR

Guillaume Lathoud, Bertrand Mesot

2005

General state-space models

Thomas Gsponer

In time series analysis state-space models provide a wide and flexible class. The basic idea is to describe an unobservable phenomenon of interest on the basis of noisy data. The first constituent of such a model is the so-called state equation, which char ...

EPFL2004

An Online Audio Indexing System

Hervé Bourlard, Jitendra Ajmera

This paper presents overview of an online audio indexing system, which creates a searchable index of speech content embedded in digitized audio files. This system is based on our recently proposed offline audio segmentation techniques. As the data arrives ...

2004

Posteriori Probabilities and Likelihoods Combination for Speech and Speaker Recognition

Hervé Bourlard

This paper investigates a new approach to perform simultaneous speech and speaker recognition. The likelihood estimated by a speaker identification system is combined with the posterior probability estimated by the speech recognizer. So, the joint posterio ...

IDIAP2004

Posteriori Probabilities and Likelihoods Combination for Speech and Speaker Recognition

Hervé Bourlard

2004

Sector-Based Detection for Hands-Free Speech Enhancement in Cars

Guillaume Lathoud

Speech-based command interfaces are becoming more and more common in cars. Applications include automatic dialog systems for hands-free phone calls as well as more advanced features such as navigation systems. However, interferences, such as speech from th ...

IDIAP2004

Spectro-Temporal Activity Pattern (STAP) Features for Noise Robust ASR

Hervé Bourlard, Hemant Misra, Shajith Ikbal

In this paper, we introduce a new noise robust representation of speech signal obtained by locating points of potential importance in the spectrogram, and parameterizing the activity of time-frequency pattern around those points. These features are referre ...

2004

Spectro-Temporal Activity Pattern (STAP) Features for Noise Robust ASR

Hervé Bourlard, Hemant Misra, Shajith Ikbal

IDIAP2004