Concept

Latency (audio)

Publications associées (53)

Graph Chatbot

Chattez avec Graph Search

Posez n’importe quelle question sur les cours, conférences, exercices, recherches, actualités, etc. de l’EPFL ou essayez les exemples de questions ci-dessous.

AVERTISSEMENT : Le chatbot Graph n'est pas programmé pour fournir des réponses explicites ou catégoriques à vos questions. Il transforme plutôt vos questions en demandes API qui sont distribuées aux différents services informatiques officiellement administrés par l'EPFL. Son but est uniquement de collecter et de recommander des références pertinentes à des contenus que vous pouvez explorer pour vous aider à répondre à vos questions.

Connectez-vous pour utiliser Chat avec Graph Search

Integrating audio and vision for robust automatic gender recognition

We propose a multi-modal Automatic Gender Recognition (AGR) system based on audio-visual cues and present its thorough evaluation in realistic scenarios. First, we analyze robustness of different audio and visual features under varying conditions and creat ...

Idiap2008

Method to generate multi-channel audio signals from stereo signals

Christof Faller

A perceptually motivated spatial decomposition for two-channel stereo audio signals, capturing the information about the virtual sound stage, is proposed. The spatial decomposition allows to re-synthesize audio signals for playback over other sound systems ...

2007

Temporal Masking for Bit-rate Reduction in Audio Codec Based on Frequency Domain Linear Prediction

Petr Motlicek, Hynek Hermansky, Sriram Ganapathy

Audio coding based on Frequency Domain Linear Prediction (FDLP) uses auto-regressive model to approximate Hilbert envelopes in frequency sub-bands for relatively long temporal segments. Although the basic technique achieves good quality of the reconstructe ...

IDIAP2007

Audio Coding Based on Long Temporal Contexts

Petr Motlicek, Hynek Hermansky

We describe novel audio coding technique designed to be utilized at medium bit-rates. Unlike classical state-of-the-art audio coders that are based on short-term spectra, our approach uses relatively long temporal segments of audio signal in critical-band- ...

IDIAP2006

Blind Audio-Visual Source Separation Using Sparse Redundant Representations

Pierre Vandergheynst, Gianluca Monaci, Anna Llagostera Casanovas

This report presents a new method to confront the Blind Audio Source Separation (BASS) problem, by means of audio and visual information. In a given mixture, we are able to locate the video sources first and, posteriorly, recover each source signal, only w ...

2006

Distributed Compression in Acoustic Sensor Networks Using Oversampled A/D Conversion

Martin Vetterli, Olivier Roy

We address the problem of distributed compression in acoustic sensor networks. A typical scenario consists of a set of microphones that record a sound source located at some unknown position. The goal then is to convey the corresponding audio signals to a ...

2006

Speech Coding based on Spectral Dynamics

Petr Motlicek, Hynek Hermansky

In this paper we present first experimental results with a novel audio coding technique based on approximating Hilbert envelopes of relatively long segments of audio signal in critical-band-sized sub-bands by autoregressive model. We exploit the generalize ...

2006

Speech Coding based on Spectral Dynamics

Petr Motlicek, Hynek Hermansky

IDIAP2006

Adaptive Joint Playout Buffer and FEC Adjustment for Internet Telephony

Jean-Yves Le Boudec, Catherine Boutremans

We develop a joint playout buffer and Forward Error Correction (FEC) adjustment scheme for Internet Telephony, which incorporates the impact of end-to-end delay on the perceived audio quality. We show that it provides better quality than the adjustment sch ...

2003

Delay aspects in Internet telephony

Catherine Boutremans

In this work, we address the transport of high quality voice over the Internet with a particular concern for delays. Transport of interactive audio over IP networks often suffers from packet loss and variations in the network delay (jitter). Forward Error ...

EPFL2003