Publications related to Overlapping speech detection using long-term conversational features for speaker diarization in meeting room conversations

Robust audio segmentation

Audio segmentation, in general, is the task of segmenting a continuous audio stream in terms of acoustically homogenous regions, where the rule of homogeneity depends on the task. This thesis aims at developing and investigating efficient, robust and unsup ...

EPFL2005

Developing and Enhancing Posterior Based Speech Recognition Systems

Hervé Bourlard, Samy Bengio, Hamed Ketabdar

Local state or phone posterior probabilities are often investigated as local scores (e.g., hybrid HMM/ANN systems) or as transformed acoustic features (e.g., ``Tandem'') to improve speech recogni tion systems. In this paper, we present initial results towa ...

2005

Developing and Enhancing Posterior Based Speech Recognition Systems

Hervé Bourlard, Samy Bengio, Hamed Ketabdar

Local state or phone posterior probabilities are often investigated as local scores (e.g., hybrid HMM/ANN systems) or as transformed acoustic features (e.g., ``Tandem'') to improve speech recogni tion systems. In this paper, we present initial results towa ...

IDIAP2005

A Probabilistic Interpretation of SVMs with an Application to Unbalanced Classification

Samy Bengio

In this paper, we show that the hinge loss can be interpreted as the neg-log-likelihood of a semi-parametric model of posterior probabilities. From this point of view, SVMs represent the parametric component of a semi-parametric model fitted by a maximum a ...

IDIAP2005

A Probabilistic Interpretation of SVMs with an Application to Unbalanced Classification

Samy Bengio

In this paper, we show that the hinge loss can be interpreted as the neg-log-likelihood of a semi-parametric model of posterior probabilities. From this point of view, SVMs represent the parametric component of a semi-parametric model fitted by a maximum a ...

2005

Towards using hierarchical posteriors for flexible automatic speech recognition systems

Hervé Bourlard, Samy Bengio, Bertrand Mesot

Local state (or phone) posterior probabilities are often investigated as local classifiers (e.g., hybrid HMM/ANN systems) or as transformed acoustic features (e.g., ``Tandem'') towards improved speech recognition systems. In this paper, we present initial ...

IDIAP2004

Robust Audio Segmentation

Hervé Bourlard, Jitendra Ajmera

Audio segmentation, in general, is the task of segmenting a continuous audio stream in terms of acoustically homogenous regions, where the rule of homogeneity depends on the task. This thesis aims at developing and investigating efficient, robust and unsup ...

IDIAP2004

Robust Audio Segmentation

Hervé Bourlard, Jitendra Ajmera

Audio segmentation, in general, is the task of segmenting a continuous audio stream in terms of acoustically homogenous regions, where the rule of homogeneity depends on the task. This thesis aims at developing and investigating efficient, robust and unsup ...

École Polytechnique Fédérale de Lausanne2004

Posteriori Probabilities and Likelihoods Combination for Speech and Speaker Recognition

Hervé Bourlard

This paper investigates a new approach to perform simultaneous speech and speaker recognition. The likelihood estimated by a speaker identification system is combined with the posterior probability estimated by the speech recognizer. So, the joint posterio ...

IDIAP2004

Posteriori Probabilities and Likelihoods Combination for Speech and Speaker Recognition

Hervé Bourlard

This paper investigates a new approach to perform simultaneous speech and speaker recognition. The likelihood estimated by a speaker identification system is combined with the posterior probability estimated by the speech recognizer. So, the joint posterio ...

2004

Overlapping speech detection using long-term conversational features for speaker diarization in meeting room conversations

Graph Chatbot

Chat with Graph Search

Robust audio segmentation

Developing and Enhancing Posterior Based Speech Recognition Systems

Developing and Enhancing Posterior Based Speech Recognition Systems

A Probabilistic Interpretation of SVMs with an Application to Unbalanced Classification

A Probabilistic Interpretation of SVMs with an Application to Unbalanced Classification

Towards using hierarchical posteriors for flexible automatic speech recognition systems

Robust Audio Segmentation

Robust Audio Segmentation

Posteriori Probabilities and Likelihoods Combination for Speech and Speaker Recognition

Posteriori Probabilities and Likelihoods Combination for Speech and Speaker Recognition

Robust audio segmentation

Robust Audio Segmentation

Robust Audio Segmentation

A Probabilistic Interpretation of SVMs with an Application to Unbalanced Classification

Posteriori Probabilities and Likelihoods Combination for Speech and Speaker Recognition

Developing and Enhancing Posterior Based Speech Recognition Systems

Developing and Enhancing Posterior Based Speech Recognition Systems

A Probabilistic Interpretation of SVMs with an Application to Unbalanced Classification

Posteriori Probabilities and Likelihoods Combination for Speech and Speaker Recognition

Towards using hierarchical posteriors for flexible automatic speech recognition systems