Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.
DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.
Conventional features in automatic recognition of speech describe instantaneous shape of a short-time spectrum of speech. The TRAP-TANDEM features describe likelihoods of sub-word classess at a given time instant, derived from temporal trajectories of band ...
The paper presents analyses, modifications, and first experiments with a new nonsense syllables database. Results of preliminary experiments with phoneme recognition are given and discussed. ...
The temporal trajectories of the spectral energy in auditory critical bands over 250~ms segments are approximated by an all-pole model, the time-domain dual of conventional linear prediction. This quarter-second auditory spectro-temporal pattern is further ...
The temporal trajectories of the spectral energy in auditory critical bands over 250~ms segments are approximated by an all-pole model, the time-domain dual of conventional linear prediction. This quarter-second auditory spectro-temporal pattern is further ...
The paper argues on examples of selected past works that stochastic and knowledge-based approaches do not contradict each other. Frequency resolution of human hearing is decreasing with increasing frequency. Spectral basis designed for optimal discriminati ...
Recently, a nonlinear transformation of autocorrelation coefficients named Phase AutoCorrelation (PAC) coefficients has been considered for feature extraction \cite{ikbal03}. PAC based features show improved robustness to additive noise as a result of two ...
Conventional features in automatic recognition of speech describe instantaneous shape of a short-time spectrum of speech. The TRAP-TANDEM features describe likelihoods of sub-word classess at a given time instant, derived from temporal trajectories of band ...
Methods to improve noise robustness of speech recognition systems often result in degradation of recognition performance for clean speech. Recently proposed Phase AutoCorrelation (PAC) \cite{ikbal03,ikbal03a} based features, showing noticeable improvement ...
The paper presents analyses, modifications, and first experiments with a new nonsense syllables database. Results of preliminary experiments with phoneme recognition are given and discussed. ...
Recently, a nonlinear transformation of autocorrelation coefficients named Phase AutoCorrelation (PAC) coefficients has been considered for feature extraction \cite{ikbal03}. PAC based features show improved robustness to additive noise as a result of two ...