Posterior Features Applied to Speech Recognition Tasks with Limited Training Data

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

This paper describes an approach where posterior-based features are applied in those recognition tasks where the amount of training data is insufficient to obtain a reliable estimate of the speech variability. A template matching approach is considered in this paper where posterior features are obtained from a MLP trained on an auxiliary database. Thus, the speech variability present in the features is reduced by applying the speech knowledge captured on the auxiliary database. When compared to state-of-the-art systems, this approach outperforms acoustic-based techniques and obtains comparable results to grapheme-based approaches. Moreover, the proposed method can be directly combined with other posterior-based HMM systems. This combination successfully exploits the complementarity between templates and parametric models.

Posterior Features Applied to Speech Recognition Tasks with Limited Training Data

Graph Chatbot

Chat with Graph Search

Training a Filter-Based Model of the Cochlea in the Context of Pre-Trained Acoustic Models

Explainable Face Verification via Feature-Guided Gradient Backpropagation

Sparse Autoencoders for Speech Modeling and Recognition

Explainable Face Verification via Feature-Guided Gradient Backpropagation

Training a Filter-Based Model of the Cochlea in the Context of Pre-Trained Acoustic Models

Sparse Autoencoders for Speech Modeling and Recognition