Publication

On the Combination of Speech and Speaker Recognition

Related publications (99)

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

A Probabilistic Interpretation of SVMs with an Application to Unbalanced Classification

Samy Bengio

In this paper, we show that the hinge loss can be interpreted as the neg-log-likelihood of a semi-parametric model of posterior probabilities. From this point of view, SVMs represent the parametric component of a semi-parametric model fitted by a maximum a ...

2005

Robust audio segmentation

Jitendra Ajmera

Audio segmentation, in general, is the task of segmenting a continuous audio stream in terms of acoustically homogenous regions, where the rule of homogeneity depends on the task. This thesis aims at developing and investigating efficient, robust and unsup ...

EPFL2005

Developing and Enhancing Posterior Based Speech Recognition Systems

Hervé Bourlard, Samy Bengio, Hamed Ketabdar

Local state or phone posterior probabilities are often investigated as local scores (e.g., hybrid HMM/ANN systems) or as transformed acoustic features (e.g., ``Tandem'') to improve speech recogni tion systems. In this paper, we present initial results towa ...

2005

Developing and Enhancing Posterior Based Speech Recognition Systems

Hervé Bourlard, Samy Bengio, Hamed Ketabdar

IDIAP2005

Towards ASR Based on Hierarchical Posterior-Based Keyword Recognition

Hynek Hermansky

The paper presents an alternative approach to automatic recognition of speech in which each targeted word is classified by a separate binary classifier against all other sounds. No time alignment is done. To build a recognizer for N words, N parallel binar ...

IDIAP2005

Using more informative posterior probabilities for speech recognition

Hervé Bourlard, Samy Bengio, Hamed Ketabdar

In this paper, we present initial investigations towards boosting posterior probability based speech recognition systems by estimating more informative posteriors taking into account acoustic context (e.g., the whole utterance), as well as possible prior i ...

IDIAP2005

Joint Speech and Speaker Recognition

The goal of the present thesis was to investigate and optimize different approaches towards User-Customized Password Speaker Verification (UCP-SV) systems. In such systems, users can choose their own passwords, which will be subsequently used for verificat ...

IDIAP2005

Joint Speech and Speaker Recognition

École Polytechnique Fédérale de Lausanne, Computer Science Department2005

Hierarchical Multi-Stream Posterior Based Speech Recognition System

Hervé Bourlard, Samy Bengio, Hamed Ketabdar

In this paper, we present initial results towards boosting posterior based speech recognition systems by estimating more informative posteriors using multiple streams of features and taking into account acoustic context (e.g., as available in the whole utt ...

2005

Hierarchical Multi-Stream Posterior Based Speech Recognition System

Hervé Bourlard, Samy Bengio, Hamed Ketabdar

IDIAP2005