Publication

Unsupervised Speech/Non-speech Detection for Automatic Speech Recognition in Meeting Rooms

Publications associées (55)

Graph Chatbot

Chattez avec Graph Search

Posez n’importe quelle question sur les cours, conférences, exercices, recherches, actualités, etc. de l’EPFL ou essayez les exemples de questions ci-dessous.

AVERTISSEMENT : Le chatbot Graph n'est pas programmé pour fournir des réponses explicites ou catégoriques à vos questions. Il transforme plutôt vos questions en demandes API qui sont distribuées aux différents services informatiques officiellement administrés par l'EPFL. Son but est uniquement de collecter et de recommander des références pertinentes à des contenus que vous pouvez explorer pour vous aider à répondre à vos questions.

Connectez-vous pour utiliser Chat avec Graph Search

Unsupervised Speech/Non-speech Detection for Automatic Speech Recognition in Meeting Rooms

Daniel Gatica-Perez, Petr Motlicek

The goal of this work is to provide robust and accurate speech detection for automatic speech recognition (ASR) in meeting room settings. The solution is based on computing long-term modulation spectrum, and examining specific frequency range for dominant ...

IDIAP2006

Model Adaptation for Sentence Unit Segmentation from Speech

Sébastien Cuendet

The sentence segmentation task is a classification task that aims at inserting sentence boundaries in a sequence of words. One of the applications of sentence segmentation is to detect the sentence boundaries in the sequence of words that is output by an a ...

IDIAP2006

The segmentation of multi-channel meeting recordings for automatic speech recognition

John David Scott Dines

One major research challenge in the domain of the analysis of meeting room data is the automatic transcription of what is spoken during meetings, a task which has gained considerable attention within the ASR research community through the NIST rich transcr ...

2006

The segmentation of multi-channel meeting recordings for automatic speech recognition

John David Scott Dines

IDIAP2006

Robust audio segmentation

Jitendra Ajmera

Audio segmentation, in general, is the task of segmenting a continuous audio stream in terms of acoustically homogenous regions, where the rule of homogeneity depends on the task. This thesis aims at developing and investigating efficient, robust and unsup ...

EPFL2005

Speech Acquisition in Meetings with an Audio-Visual Sensor Array

Daniel Gatica-Perez, Darren Moore, Silèye Oumar Ba

Close-talk headset microphones have been traditionally used for speech acquisition in a number of applications, as they naturally provide a higher signal-to-noise ratio -needed for recognition tasks- than single distant microphones. However, in multi-party ...

IDIAP2005

Speech Acquisition in Meetings with an Audio-Visual Sensor Array

Daniel Gatica-Perez, Darren Moore, Silèye Oumar Ba

2005

Can a Professional Imitator Fool a GMM-Based Speaker Verification System?

Samy Bengio

This paper presents an attempt at assessing empirically how a state-of-the-art text-independent speaker verification system behaves when confronted to imposting attempts from a professional imitator who perfectly knows how to imitate in particular the clie ...

IDIAP2005

Robust Audio Segmentation

Hervé Bourlard, Jitendra Ajmera

IDIAP2004

Robust Audio Segmentation

Hervé Bourlard, Jitendra Ajmera

École Polytechnique Fédérale de Lausanne2004