Publications related to Speaker diarization of overlapping speech based on silence distribution in meeting recordings

Posterior-Based Features and Distances in Template Matching for Speech Recognition

The use of large speech corpora in example-based approaches for speech recognition is mainly focused on increasing the number of examples. This strategy presents some difficulties because databases may not provide enough examples for some rare words. In th ...

IDIAP2007

Novel speech processing techniques for robust automatic speech recognition

Vivek Tyagi

The goal of this thesis is to develop and design new feature representations that can improve the automatic speech recognition (ASR) performance in clean as well noisy conditions. One of the main shortcomings of the fixed scale (typically 20-30 ms long ana ...

EPFL2006

Robust audio segmentation

Jitendra Ajmera

Audio segmentation, in general, is the task of segmenting a continuous audio stream in terms of acoustically homogenous regions, where the rule of homogeneity depends on the task. This thesis aims at developing and investigating efficient, robust and unsup ...

EPFL2005

An Online Audio Indexing System

Hervé Bourlard, Jitendra Ajmera

This paper presents overview of an online audio indexing system, which creates a searchable index of speech content embedded in digitized audio files. This system is based on our recently proposed offline audio segmentation techniques. As the data arrives ...

2004

Robust Audio Segmentation

Hervé Bourlard, Jitendra Ajmera

Audio segmentation, in general, is the task of segmenting a continuous audio stream in terms of acoustically homogenous regions, where the rule of homogeneity depends on the task. This thesis aims at developing and investigating efficient, robust and unsup ...

IDIAP2004

Robust Audio Segmentation

Hervé Bourlard, Jitendra Ajmera

Audio segmentation, in general, is the task of segmenting a continuous audio stream in terms of acoustically homogenous regions, where the rule of homogeneity depends on the task. This thesis aims at developing and investigating efficient, robust and unsup ...

École Polytechnique Fédérale de Lausanne2004

A Sector-Based Approach for Localization of Multiple Speakers with Microphone Arrays

Guillaume Lathoud

Microphone arrays are useful in meeting rooms, where speech needs to be acquired and segmented. For example, automatic speech segmentation allows enhanced browsing experience, and facilitates automatic analysis of large amounts of data. Spontaneous multi-p ...

IDIAP2004

A Sector-Based Approach for Localization of Multiple Speakers with Microphone Arrays

Guillaume Lathoud

Microphone arrays are useful in meeting rooms, where speech needs to be acquired and segmented. For example, automatic speech segmentation allows enhanced browsing experience, and facilitates automatic analysis of large amounts of data. Spontaneous multi-p ...

2004

An Online Audio Indexing System

Hervé Bourlard, Jitendra Ajmera

This paper presents overview of an online audio indexing system, which creates a searchable index of speech content embedded in digitized audio files. This system is based on our recently proposed offline audio segmentation techniques. As the data arrives ...

IDIAP2003

Some Emerging Concepts in Speech Recognition.

Hervé Bourlard, Hynek Hermansky

The paper presents a work-in-progress on several emerging concepts in Automatic Speech Recognition (ASR), that are being currently studied at IDIAP. This work can be roughly categorized into three categories: 1) data-guided features, 2) features based on m ...

IDIAP2003

Speaker diarization of overlapping speech based on silence distribution in meeting recordings

Graph Chatbot

Chat with Graph Search

Posterior-Based Features and Distances in Template Matching for Speech Recognition

Novel speech processing techniques for robust automatic speech recognition

Robust audio segmentation

An Online Audio Indexing System

Robust Audio Segmentation

Robust Audio Segmentation

A Sector-Based Approach for Localization of Multiple Speakers with Microphone Arrays

A Sector-Based Approach for Localization of Multiple Speakers with Microphone Arrays

An Online Audio Indexing System

Some Emerging Concepts in Speech Recognition.

Robust audio segmentation

Robust Audio Segmentation

Robust Audio Segmentation

Posterior-Based Features and Distances in Template Matching for Speech Recognition

Novel speech processing techniques for robust automatic speech recognition

A Sector-Based Approach for Localization of Multiple Speakers with Microphone Arrays

An Online Audio Indexing System

A Sector-Based Approach for Localization of Multiple Speakers with Microphone Arrays

An Online Audio Indexing System

Some Emerging Concepts in Speech Recognition.