Publication

Novel speech processing techniques for robust automatic speech recognition

Related publications (131)

AM-FM DECOMPOSITION OF SPEECH SIGNAL: APPLICATIONS FOR SPEECH PRIVACY AND DIAGNOSIS

Petr Motlicek, Hynek Hermansky, Sriram Ganapathy, Amrutha Prasad

Although current trends in speech processing consider deep learning through data-driven technologies, many potential applications exhibit lack of training or development data. Therefore, considerably light signal processing techniques are still of interest ...
Idiap2020

AM-FM DECOMPOSITION OF SPEECH SIGNAL: APPLICATIONS FOR SPEECH PRIVACY AND DIAGNOSIS

Petr Motlicek, Hynek Hermansky, Sriram Ganapathy, Amrutha Prasad

Although current trends in speech processing consider deep learning through data-driven technologies, many potential applications exhibit lack of training or development data. Therefore, considerably light signal processing techniques are still of interest ...
2019

Spectral Subspace Analysis for Automatic Assessment of Pathological Speech Intelligibility

Hervé Bourlard, Ina Kodrasi, Parvaneh Janbakhshi

Speech intelligibility is an important assessment criterion of the communicative performance of pathological speakers. To assist clinicians in their assessment, time- and cost-efficient automatic intelligibility measures offering a repeatable and reliable ...
2019

Joint estimation of RETF vector and power spectral densities for speech enhancement based on alternating least squares

Ina Kodrasi

The multi-channel Wiener filter (MWF) is a well-known multi-microphone speech enhancement technique, aiming at improving the quality of the recorded speech signals in noisy and reverberant environments. Assuming that reverberation and ambient noise can be ...
2019

Combining the SNR Spectrum with a Cochlear Model

Philip Neil Garner

The SNR spectrum was previously introduced as a natural consequence of using cepstral normalisa- tion in speech recognition; it is closely related to the articulation index of Fletcher. Motivated initially by a theoretical difficulty in frequency warping, ...
Idiap2018

Predicting Group Performance in Task-Based Interaction

Catharine Regina Monika Maria Oertel Genannt Bierbach

We address the problem of automatically predicting group performance on a task, using multimodal features derived from the group conversation. These include acoustic features extracted from the speech signal, and linguistic features derived from the conver ...
ASSOC COMPUTING MACHINERY2018

Characterisation of voice quality of Parkinson's disease using differential phonological posterior features

Milos Cernak

Change in voice quality (VQ) is one of the first precursors of Parkinson's disease (PD). Specifically, impacted phonation and articulation causes the patient to have a breathy, husky-semiwhisper and hoarse voice. A goal of this paper is to characterize a V ...
Idiap2017

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.