SNR Features for Automatic Speech Recognition

When combined with cepstral normalisation techniques, the features normally used in Automatic Speech Recognition are based on Signal to Noise Ratio (SNR). We show that calculating SNR from the outset, rather than relying on cepstral normalisation to produce it, gives features with a number of practical and mathematical advantages over power-spectral based ones. In a detailed analysis, we derive Maximum Likelihood and Maximum a-Posteriori estimates for SNR based features, and show that they can outperform more conventional ones, especially when subsequently combined with cepstral variance normalisation. We further show anecdotal evidence that SNR based features lend themselves well to noise estimates based on low-energy envelope tracking.

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

SNR Features for Automatic Speech Recognition

Graph Chatbot

Chat with Graph Search

Does powder averaging remove dispersion bias in diffusion MRI diameter estimates within real 3D axonal architectures?

Parameter Estimation of Three-Phase Untransposed Short Transmission Lines from Synchrophasor Measurements

Denoising and Raw-waveform Networks for Weakly-Supervised Gender Identification on Noisy Speech

Does powder averaging remove dispersion bias in diffusion MRI diameter estimates within real 3D axonal architectures?

Parameter Estimation of Three-Phase Untransposed Short Transmission Lines from Synchrophasor Measurements

Denoising and Raw-waveform Networks for Weakly-Supervised Gender Identification on Noisy Speech