Publications related to Unsupervised Speech/Non-speech Detection for Automatic Speech Recognition in Meeting Rooms

Unsupervised Speech/Non-speech Detection for Automatic Speech Recognition in Meeting Rooms

The goal of this work is to provide robust and accurate speech detection for automatic speech recognition (ASR) in meeting room settings. The solution is based on computing long-term modulation spectrum, and examining specific frequency range for dominant ...

IDIAP2006

Model Adaptation for Sentence Unit Segmentation from Speech

Sébastien Cuendet

The sentence segmentation task is a classification task that aims at inserting sentence boundaries in a sequence of words. One of the applications of sentence segmentation is to detect the sentence boundaries in the sequence of words that is output by an a ...

IDIAP2006

The segmentation of multi-channel meeting recordings for automatic speech recognition

John David Scott Dines

One major research challenge in the domain of the analysis of meeting room data is the automatic transcription of what is spoken during meetings, a task which has gained considerable attention within the ASR research community through the NIST rich transcr ...

2006

The segmentation of multi-channel meeting recordings for automatic speech recognition

John David Scott Dines

One major research challenge in the domain of the analysis of meeting room data is the automatic transcription of what is spoken during meetings, a task which has gained considerable attention within the ASR research community through the NIST rich transcr ...

IDIAP2006

Robust audio segmentation

Jitendra Ajmera

Audio segmentation, in general, is the task of segmenting a continuous audio stream in terms of acoustically homogenous regions, where the rule of homogeneity depends on the task. This thesis aims at developing and investigating efficient, robust and unsup ...

EPFL2005

Speech Acquisition in Meetings with an Audio-Visual Sensor Array

Daniel Gatica-Perez, Darren Moore, Silèye Oumar Ba

Close-talk headset microphones have been traditionally used for speech acquisition in a number of applications, as they naturally provide a higher signal-to-noise ratio -needed for recognition tasks- than single distant microphones. However, in multi-party ...

IDIAP2005

Speech Acquisition in Meetings with an Audio-Visual Sensor Array

Daniel Gatica-Perez, Darren Moore, Silèye Oumar Ba

Close-talk headset microphones have been traditionally used for speech acquisition in a number of applications, as they naturally provide a higher signal-to-noise ratio -needed for recognition tasks- than single distant microphones. However, in multi-party ...

2005

Can a Professional Imitator Fool a GMM-Based Speaker Verification System?

Samy Bengio

This paper presents an attempt at assessing empirically how a state-of-the-art text-independent speaker verification system behaves when confronted to imposting attempts from a professional imitator who perfectly knows how to imitate in particular the clie ...

IDIAP2005

Robust Audio Segmentation

Hervé Bourlard, Jitendra Ajmera

Audio segmentation, in general, is the task of segmenting a continuous audio stream in terms of acoustically homogenous regions, where the rule of homogeneity depends on the task. This thesis aims at developing and investigating efficient, robust and unsup ...

IDIAP2004

Robust Audio Segmentation

Hervé Bourlard, Jitendra Ajmera

Audio segmentation, in general, is the task of segmenting a continuous audio stream in terms of acoustically homogenous regions, where the rule of homogeneity depends on the task. This thesis aims at developing and investigating efficient, robust and unsup ...

École Polytechnique Fédérale de Lausanne2004

Unsupervised Speech/Non-speech Detection for Automatic Speech Recognition in Meeting Rooms

Graph Chatbot

Chat with Graph Search

Unsupervised Speech/Non-speech Detection for Automatic Speech Recognition in Meeting Rooms

Model Adaptation for Sentence Unit Segmentation from Speech

The segmentation of multi-channel meeting recordings for automatic speech recognition

The segmentation of multi-channel meeting recordings for automatic speech recognition

Robust audio segmentation

Speech Acquisition in Meetings with an Audio-Visual Sensor Array

Speech Acquisition in Meetings with an Audio-Visual Sensor Array

Can a Professional Imitator Fool a GMM-Based Speaker Verification System?

Robust Audio Segmentation

Robust Audio Segmentation

Unsupervised Speech/Non-speech Detection for Automatic Speech Recognition in Meeting Rooms

Model Adaptation for Sentence Unit Segmentation from Speech

Speech Acquisition in Meetings with an Audio-Visual Sensor Array

The segmentation of multi-channel meeting recordings for automatic speech recognition

Can a Professional Imitator Fool a GMM-Based Speaker Verification System?

Robust Audio Segmentation

The segmentation of multi-channel meeting recordings for automatic speech recognition

Speech Acquisition in Meetings with an Audio-Visual Sensor Array

Robust audio segmentation

Robust Audio Segmentation