Fast keyword detection with sparse time-frequency models

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

We address the problem of keyword spotting in continuous speech streams when training and testing conditions can be different. We propose a keyword spotting algorithm based on sparse representation of speech signals in a time-frequency feature space. The training speech elements are jointly represented in a common subspace built on simple basis functions. The subspace is trained in order to capture the common time-frequency structures from different occurrences of the keywords to be spotted. The keyword spotting algorithm then employs a sliding window mechanism on speech streams. It computes the contribution of successive speech segments in the subspace of interest and evaluates the similarity with the training data. Experimental results on the TIMIT database show the effectiveness and the noise resilience of the low complexity spotting algorithm.

Fast keyword detection with sparse time-frequency models

Graph Chatbot

Chat with Graph Search

Training a Filter-Based Model of the Cochlea in the Context of Pre-Trained Acoustic Models

Statistical Inference for Inverse Problems: From Sparsity-Based Methods to Neural Networks

Sparse Autoencoders for Speech Modeling and Recognition

Training a Filter-Based Model of the Cochlea in the Context of Pre-Trained Acoustic Models

Statistical Inference for Inverse Problems: From Sparsity-Based Methods to Neural Networks

Sparse Autoencoders for Speech Modeling and Recognition