Publication

On the Combination of Speech and Speaker Recognition

Publications associées (99)

Graph Chatbot

Chattez avec Graph Search

Posez n’importe quelle question sur les cours, conférences, exercices, recherches, actualités, etc. de l’EPFL ou essayez les exemples de questions ci-dessous.

AVERTISSEMENT : Le chatbot Graph n'est pas programmé pour fournir des réponses explicites ou catégoriques à vos questions. Il transforme plutôt vos questions en demandes API qui sont distribuées aux différents services informatiques officiellement administrés par l'EPFL. Son but est uniquement de collecter et de recommander des références pertinentes à des contenus que vous pouvez explorer pour vous aider à répondre à vos questions.

Connectez-vous pour utiliser Chat avec Graph Search

A Probabilistic Interpretation of SVMs with an Application to Unbalanced Classification

Samy Bengio

In this paper, we show that the hinge loss can be interpreted as the neg-log-likelihood of a semi-parametric model of posterior probabilities. From this point of view, SVMs represent the parametric component of a semi-parametric model fitted by a maximum a ...

2005

Robust audio segmentation

Jitendra Ajmera

Audio segmentation, in general, is the task of segmenting a continuous audio stream in terms of acoustically homogenous regions, where the rule of homogeneity depends on the task. This thesis aims at developing and investigating efficient, robust and unsup ...

EPFL2005

Developing and Enhancing Posterior Based Speech Recognition Systems

Hervé Bourlard, Samy Bengio, Hamed Ketabdar

Local state or phone posterior probabilities are often investigated as local scores (e.g., hybrid HMM/ANN systems) or as transformed acoustic features (e.g., ``Tandem'') to improve speech recogni tion systems. In this paper, we present initial results towa ...

2005

Developing and Enhancing Posterior Based Speech Recognition Systems

Hervé Bourlard, Samy Bengio, Hamed Ketabdar

IDIAP2005

Towards ASR Based on Hierarchical Posterior-Based Keyword Recognition

Hynek Hermansky

The paper presents an alternative approach to automatic recognition of speech in which each targeted word is classified by a separate binary classifier against all other sounds. No time alignment is done. To build a recognizer for N words, N parallel binar ...

IDIAP2005

Using more informative posterior probabilities for speech recognition

Hervé Bourlard, Samy Bengio, Hamed Ketabdar

In this paper, we present initial investigations towards boosting posterior probability based speech recognition systems by estimating more informative posteriors taking into account acoustic context (e.g., the whole utterance), as well as possible prior i ...

IDIAP2005

Joint Speech and Speaker Recognition

The goal of the present thesis was to investigate and optimize different approaches towards User-Customized Password Speaker Verification (UCP-SV) systems. In such systems, users can choose their own passwords, which will be subsequently used for verificat ...

IDIAP2005

Joint Speech and Speaker Recognition

École Polytechnique Fédérale de Lausanne, Computer Science Department2005

Hierarchical Multi-Stream Posterior Based Speech Recognition System

Hervé Bourlard, Samy Bengio, Hamed Ketabdar

In this paper, we present initial results towards boosting posterior based speech recognition systems by estimating more informative posteriors using multiple streams of features and taking into account acoustic context (e.g., as available in the whole utt ...

2005

Hierarchical Multi-Stream Posterior Based Speech Recognition System

Hervé Bourlard, Samy Bengio, Hamed Ketabdar

IDIAP2005