Publication

Robust Audio Segmentation

Publications associées (168)

Graph Chatbot

Chattez avec Graph Search

Posez n’importe quelle question sur les cours, conférences, exercices, recherches, actualités, etc. de l’EPFL ou essayez les exemples de questions ci-dessous.

AVERTISSEMENT : Le chatbot Graph n'est pas programmé pour fournir des réponses explicites ou catégoriques à vos questions. Il transforme plutôt vos questions en demandes API qui sont distribuées aux différents services informatiques officiellement administrés par l'EPFL. Son but est uniquement de collecter et de recommander des références pertinentes à des contenus que vous pouvez explorer pour vous aider à répondre à vos questions.

Connectez-vous pour utiliser Chat avec Graph Search

On the Combination of Speech and Speaker Recognition

Hervé Bourlard

This paper investigates an approach that maximizes the joint posterior probabil ity of the pronounced word and the speaker identity given the observed data. This probability can be expressed as a product of the posterior probability of the pronounced word ...

2003

An Online Audio Indexing System

Hervé Bourlard, Jitendra Ajmera

This paper presents overview of an online audio indexing system, which creates a searchable index of speech content embedded in digitized audio files. This system is based on our recently proposed offline audio segmentation techniques. As the data arrives ...

IDIAP2003

Finding Structure in Home Videos by Probabilistic Hierarchical Clustering

Daniel Gatica-Perez

Accessing, organizing, and manipulating home videos present technical challenges due to their unrestricted content and lack of storyline. In this paper, we present a methodology to discover cluster structure in home videos, which uses video shots as the un ...

2003

Dichotomy Between Clustering Performance and Minimum Distortion in Piecewise-Dependent-Data (PDD) Clustering

In many signal such speech, bio-signals, protein chains, etc. there is a dependency between consecutive vectors. As the dependency is limited in duration such data can be called as Piecewise-Dependent- Data (PDD). In clustering it is frequently needed to m ...

2003

Phase AutoCorrelation (PAC) features in Entropy based Multi-Stream for Robust Speech Recognition

Hervé Bourlard, Hynek Hermansky, Hemant Misra, Shajith Ikbal

Methods to improve noise robustness of speech recognition systems often result in degradation of recognition performance for clean speech. Recently proposed Phase AutoCorrelation (PAC) \cite{ikbal03,ikbal03a} based features, showing noticeable improvement ...

IDIAP2003

Robust Speaker Change Detection

Hervé Bourlard, Jitendra Ajmera

Most commonly used criteria for speaker change detection like log likelihood ratio (LLR) and Bayesian information criterion (BIC) have an adjustathreshold/penalty parameter to make speaker change decisions. These parameters robust to different acoustic con ...

2003

Confidence Measures in Multiple pronunciations Modeling For Speaker Verification

Hervé Bourlard

This paper investigates the use of multiple pronunciations modeling for User-Customized Password Speaker Verification (UCP-SV). The main characteristic of the UCP-SV is that the system does not have any {\it a priori} knowledge about the password used by t ...

IDIAP2003

Robust HMM-Based Speech/Music Segmentation

Hervé Bourlard, Jitendra Ajmera

In this paper we present a new approach towards high performance speech/music segmentation on realistic tasks related to the automatic transcription of broadcast news. In the approach presented here, the local probability density function (PDF) estimators ...

2002

Self-Organizing-Maps With BIC For Speaker Clustering

A new approach is presented for clustering the speakers from unlabeled and unsegmented conversation, when the number of speakers is unknown. In this approach, each speaker is modeled by a Self- Organizing-Map (SOM). For estimation of the number of clusters ...

IDIAP2002

Finding Structure in Consumer Videos by Probabilistic Hierarchical Clustering

Daniel Gatica-Perez

IDIAP2002