Publication

Segmenting Multiple Concurrent Speakers Using Microphone Arrays

Publications associées (41)

Graph Chatbot

Chattez avec Graph Search

Posez n’importe quelle question sur les cours, conférences, exercices, recherches, actualités, etc. de l’EPFL ou essayez les exemples de questions ci-dessous.

AVERTISSEMENT : Le chatbot Graph n'est pas programmé pour fournir des réponses explicites ou catégoriques à vos questions. Il transforme plutôt vos questions en demandes API qui sont distribuées aux différents services informatiques officiellement administrés par l'EPFL. Son but est uniquement de collecter et de recommander des références pertinentes à des contenus que vous pouvez explorer pour vous aider à répondre à vos questions.

Connectez-vous pour utiliser Chat avec Graph Search

Segmenting Multiple Concurrent Speakers Using Microphone Arrays

Guillaume Lathoud, Darren Moore

Speaker turn detection is an important task for many speech processing applications. However, accurate segmentation can be hard to achieve if there are multiple concurrent speakers (overlap), as is typically the case in multi-party conversations. In such c ...

2003

Location Based Speaker Segmentation

Guillaume Lathoud

This paper proposes a technique that segments into speaker turns based on their location, essentially implementing a discrete source tracking system. In many multi-party conversations, such as meetings or teleconferences, the location of participants is re ...

2003

Microphone Array Speech Recognition : Experiments on Overlapping Speech in Meetings

Darren Moore

This paper investigates the use of microphone arrays to acquire and recognise speech in meetings. Meetings pose several interesting problems for speech processing, as they consist of multiple competing speakers within a small space, typically around a tabl ...

2003

Audio-Visual Speaker Tracking with Importance Particle Filters

Daniel Gatica-Perez, Jean-Marc Odobez, Guillaume Lathoud, Darren Moore

We present a probabilistic methodology for audio-visual (AV) speaker tracking, using an uncalibrated wide-angle camera and a microphone array. The algorithm fuses 2-D object shape and audio information via importance particle filters (I-PFs), allowing for ...

2003

An Online Audio Indexing System

Hervé Bourlard, Jitendra Ajmera

This paper presents overview of an online audio indexing system, which creates a searchable index of speech content embedded in digitized audio files. This system is based on our recently proposed offline audio segmentation techniques. As the data arrives ...

IDIAP2003

Small Microphone Array: Algorithms and Hardware

Darren Moore

This report describes the processing algorithms and gives an overview of the hardware for the small microphone array unit in the IM2.RTMAP (Real-time Microphone Array Processing) project. The algorithms include techniques for speech enhancement, speaker lo ...

IDIAP2003

Location Based Speaker Segmentation

Guillaume Lathoud

IDIAP2002

Microphone Array Speech Recognition : Experiments on Overlapping Speech in Meetings

Darren Moore

IDIAP2002

Audio-Visual Speaker Tracking with Importance Particle Filters

Daniel Gatica-Perez, Jean-Marc Odobez, Guillaume Lathoud, Darren Moore

IDIAP2002

R/D optimal linear prediction

Martin Vetterli, Paolo Prandoni

A common technique to extend linear prediction to nonstationary signals is time segmentation: the signal is split into small portions and the modelization is carried out locally. The accuracy of the analysis is, however, dependent on the window size and on ...

2000