Publications related to Unsupervised Speech/Non-speech Detection for Automatic Speech Recognition in Meeting Rooms

A multimodal pattern recognition framework for speaker detection

Speaker detection is an important component of a speech-based user interface. Audiovisual speaker detection, speech and speaker recognition or speech synthesis for example find multiple applications in human-computer interaction, multimedia content indexin ...

EPFL2007

The segmentation of multi-channel meeting recordings for automatic speech recognition

John David Scott Dines

One major research challenge in the domain of the analysis of meeting room data is the automatic transcription of what is spoken during meetings, a task which has gained considerable attention within the ASR research community through the NIST rich transcr ...

2006

The segmentation of multi-channel meeting recordings for automatic speech recognition

John David Scott Dines

One major research challenge in the domain of the analysis of meeting room data is the automatic transcription of what is spoken during meetings, a task which has gained considerable attention within the ASR research community through the NIST rich transcr ...

IDIAP2006

Robust audio segmentation

Jitendra Ajmera

Audio segmentation, in general, is the task of segmenting a continuous audio stream in terms of acoustically homogenous regions, where the rule of homogeneity depends on the task. This thesis aims at developing and investigating efficient, robust and unsup ...

EPFL2005

Speech Acquisition in Meetings with an Audio-Visual Sensor Array

Daniel Gatica-Perez, Darren Moore, Silèye Oumar Ba

Close-talk headset microphones have been traditionally used for speech acquisition in a number of applications, as they naturally provide a higher signal-to-noise ratio -needed for recognition tasks- than single distant microphones. However, in multi-party ...

IDIAP2005

Speech Acquisition in Meetings with an Audio-Visual Sensor Array

Daniel Gatica-Perez, Darren Moore, Silèye Oumar Ba

Close-talk headset microphones have been traditionally used for speech acquisition in a number of applications, as they naturally provide a higher signal-to-noise ratio -needed for recognition tasks- than single distant microphones. However, in multi-party ...

2005

Can a Professional Imitator Fool a GMM-Based Speaker Verification System?

Samy Bengio

This paper presents an attempt at assessing empirically how a state-of-the-art text-independent speaker verification system behaves when confronted to imposting attempts from a professional imitator who perfectly knows how to imitate in particular the clie ...

IDIAP2005

Robust Audio Segmentation

Hervé Bourlard, Jitendra Ajmera

Audio segmentation, in general, is the task of segmenting a continuous audio stream in terms of acoustically homogenous regions, where the rule of homogeneity depends on the task. This thesis aims at developing and investigating efficient, robust and unsup ...

IDIAP2004

Robust Audio Segmentation

Hervé Bourlard, Jitendra Ajmera

Audio segmentation, in general, is the task of segmenting a continuous audio stream in terms of acoustically homogenous regions, where the rule of homogeneity depends on the task. This thesis aims at developing and investigating efficient, robust and unsup ...

École Polytechnique Fédérale de Lausanne2004

Unsupervised Location-Based Segmentation of Multi-Party Speech

Jean-Marc Odobez, Guillaume Lathoud

Accurate detection and segmentation of spontaneous multi-party speech is crucial for a variety of applications, including speech acquisition and recognition, as well as higher-level event recognition. However, the highly sporadic nature of spontaneous spee ...

2004

Unsupervised Speech/Non-speech Detection for Automatic Speech Recognition in Meeting Rooms

Graph Chatbot

Chat with Graph Search

A multimodal pattern recognition framework for speaker detection

The segmentation of multi-channel meeting recordings for automatic speech recognition

The segmentation of multi-channel meeting recordings for automatic speech recognition

Robust audio segmentation

Speech Acquisition in Meetings with an Audio-Visual Sensor Array

Speech Acquisition in Meetings with an Audio-Visual Sensor Array

Can a Professional Imitator Fool a GMM-Based Speaker Verification System?

Robust Audio Segmentation

Robust Audio Segmentation

Unsupervised Location-Based Segmentation of Multi-Party Speech

A multimodal pattern recognition framework for speaker detection

Robust Audio Segmentation

The segmentation of multi-channel meeting recordings for automatic speech recognition

Speech Acquisition in Meetings with an Audio-Visual Sensor Array

The segmentation of multi-channel meeting recordings for automatic speech recognition

Robust Audio Segmentation

Can a Professional Imitator Fool a GMM-Based Speaker Verification System?

Unsupervised Location-Based Segmentation of Multi-Party Speech

Robust audio segmentation

Speech Acquisition in Meetings with an Audio-Visual Sensor Array