Concept

Magnétophone

Publications associées (52)

Graph Chatbot

Chattez avec Graph Search

Posez n’importe quelle question sur les cours, conférences, exercices, recherches, actualités, etc. de l’EPFL ou essayez les exemples de questions ci-dessous.

AVERTISSEMENT : Le chatbot Graph n'est pas programmé pour fournir des réponses explicites ou catégoriques à vos questions. Il transforme plutôt vos questions en demandes API qui sont distribuées aux différents services informatiques officiellement administrés par l'EPFL. Son but est uniquement de collecter et de recommander des références pertinentes à des contenus que vous pouvez explorer pour vous aider à répondre à vos questions.

Connectez-vous pour utiliser Chat avec Graph Search

The segmentation of multi-channel meeting recordings for automatic speech recognition

John David Scott Dines

One major research challenge in the domain of the analysis of meeting room data is the automatic transcription of what is spoken during meetings, a task which has gained considerable attention within the ASR research community through the NIST rich transcr ...

2006

The segmentation of multi-channel meeting recordings for automatic speech recognition

John David Scott Dines

IDIAP2006

Improved Time Delay Analysis/Synthesis for Parametric Stereo Audio Coding

Christof Faller, Christophe Tournery

For parametric stereo and multi-channel audio coding, it has been proposed to use level difference, time difference, and coherence cues between audio channels to represent the perceptual spatial features of stereo and multi-channel audio signals. In practi ...

2006

Playback Delay and Buffering Optimization in Scalable Video Broadcasting

Pascal Frossard, Jean-Paul Wagner

This paper addresses the problem of optimizing the playback delay experienced by a population of heterogeneous clients, in video streaming applications. We consider a typical broadcast scenario, where clients subscribe to different portions of a scalable v ...

2005

Playback Delay Optimization in Scalable Video Streaming

Pascal Frossard, Jean-Paul Wagner

2005

Clustering And Segmenting Speakers And Their Locations In Meetings

Guillaume Lathoud, Jitendra Ajmera

This paper presents a new approach toward automatic annotation of meetings in terms of speaker identities and their locations. This is achieved by segmenting the audio recordings using two independent sources of information : magnitude spectrum analysis an ...

2004

Clustering And Segmenting Speakers And Their Locations In Meetings

Guillaume Lathoud, Jitendra Ajmera

IDIAP2003

The VidTIMIT Database

This communication describes the multi-modal VidTIMIT database, which can be useful for research involving mono- or multi-modal speech recognition or person authentication. It is comprised of video and corresponding audio recordings of 43 volunteers, recit ...

IDIAP2002

The IDIAP Smart Meeting Room

Darren Moore

The IDIAP Smart Meeting Room is a meeting room equipped with synchronised, multi-channel audio-visual recording facilities. This document presents a detailed description of the room with particular emphasis on the acquisition equipment and the components u ...

IDIAP2002

Comparison of the recording dynamics of phenanthrenequinone-doped poly(methyl methacrylate) materials

Demetri Psaltis

The comparison between the NCTU and Caltech PQ-PMMA material shows that the difference in their behavior lies in the different concentration of residual MMA in the samples. Experimental evidence shows that during recording, PQ molecules attach to MMA but n ...

2001