Cepstral normalisation and the signal to noise ratio spectrum in automatic speech recognition
Graph Chatbot
Chat with Graph Search
Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.
DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.
Cepstral normalisation in automatic speech recognition is investigated in the context of robustness to additive noise. It is argued that such normalisation leads naturally to a speech feature based on signal to noise ratio rather than absolute energy (or p ...
Microfluidics and optofluidics have revolutionized high-throughput analysis and chemical synthesis over the past decade. Single molecule imaging has witnessed similar growth, due to its capacity to reveal heterogeneities at high spatial and temporal resolu ...
The standard approach to speaker verification is to extract cepstral features from the speech spectrum and model them by generative or discriminative techniques. We propose a novel approach where a set of client-specific binary features carrying maximal di ...
Cochlear implant-like spectrally reduced speech (SRS) has previously been shown to afford robustness to additive noise. In this paper, it is evaluated in the context of microphone array based automatic speech recognition (ASR). It is compared to and combin ...
On top of the many external perturbations, cellular oscillators also face intrinsic perturbations due the randomness of chemical kinetics. Biomolecular oscillators, distinct in their parameter sets or distinct in their architecture, show different resilien ...
In this paper, we consider the problem of speaker verification as a two-class object detection problem in computer vision, where the object instances are 1-D short-time spectral vectors obtained from the speech signal. More precisely, we investigate the ge ...
High spatial (~cm) and spectral (~MHz) resolution Brillouin sensing is realized with enhanced signal to noise ratio using a pre-activated acoustic field and an optical phase control over the interrogating pulse. Pre-activation of the acoustic field preserv ...
This paper demonstrates the robustness of group delay based features to additive noise. First, we analytically show the robustness of group delay based represen- tations. The analysis makes use of the fact that, for minimum-phase signals, the group delay f ...
Using a transformation based at least in part on a non-simple orthogonal or unitary matrix, data may be transmitted over a data bus in a manner that is resilient to one or more types of signal noise, that does not require a common reference at the transmis ...
Adaptive networks, consisting of a collection of nodes with learning abilities, are well-suited to solve distributed inference problems and to model various types of self-organized behavior observed in nature. One important issue in designing adaptive netw ...