Publication

End-to-End Acoustic Modeling using Convolutional Neural Networks for HMM-based Automatic Speech Recognition

Related publications (181)

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

HMM Mixtures (HMM2) for Robust Speech Recognition

Katrin Weber

State-of-the-art automatic speech recognition (ASR) techniques are typically based on hidden Markov models (HMMs) for the modeling of temporal sequences of feature vectors extracted from the speech signal. At the level of each HMM state, Gaussian mixture m ...

IDIAP2003

Speech/Music Discrimination using Entropy and Dynamism Features in a HMM Classification Framework

Hervé Bourlard, Jitendra Ajmera

In this paper, we present a new approach towards high performance speech/music discrimination on realistic tasks related to the automatic transcription of broadcast news. In the approach presented here, the (local) Probability Density Function (PDF) estima ...

2003

Robust Speech Recognition and Feature Extraction Using HMM2

Hervé Bourlard, Samy Bengio, Katrin Weber, Shajith Ikbal

This paper presents the theoretical basis and preliminary experimental results of a new HMM model, referred to as HMM2, which can be considered as a mixture of HMMs. In this new model, the emission probabilities of the temporal (primary) HMM are estimated ...

2003

Microphone Array Speech Recognition : Experiments on Overlapping Speech in Meetings

Darren Moore

This paper investigates the use of microphone arrays to acquire and recognise speech in meetings. Meetings pose several interesting problems for speech processing, as they consist of multiple competing speakers within a small space, typically around a tabl ...

2003

HMM inference towards flexible speech recognition

One of the difficulties in Automatic Speech Recognizer (ASR) is the pronunciation variability. Each word (modeled by a baseline phonetic transcription in the ASR dictionary) can be pronounced in many different ways depending on many complex qualitative and ...

IDIAP2003

User-Customized Password Speaker Verification based on HMM/ANN and GMM Models

Hervé Bourlard

In this paper, we present a new approach towards user-custom-ized password speaker verification combining the advantages of hybrid HMM/ANN systems, using Artificial Neural Networks (ANN) to estimate emission probabilities of Hidden Markov Models, and Gaus ...

IDIAP2002

User-Customized Password Speaker Verification based on HMM/ANN and GMM Models

Hervé Bourlard

2002

Speaker Normalization using HMM2

Hervé Bourlard, Katrin Weber, Shajith Ikbal

In this paper, we present an HMM2 based method for speaker normalization. Introduced as an extension of Hidden Markov Model (HMM), HMM2 differentiates itself from the regular HMM in terms of the emission density modeling, which is done by a set of state-de ...

IDIAP2002

Speaker Normalization using HMM2

Hervé Bourlard, Katrin Weber, Shajith Ikbal

2002

Speech Processing & Text-Independent Automatic Person Verification

In this communication we first review the human speech production process and feature extraction approaches commonly used in a speaker verification system. Experiments on the telephone speech {NTIMIT} database suggest that the performance degradation of a ...

IDIAP2002