Concept

Mel-frequency cepstrum

Publications associées (84)

Neural VTLN for Speaker Adaptation in TTS

Vocal tract length normalisation (VTLN) is well established as a speaker adaptation technique that can work with very little adaptation data. It is also well known that VTLN can be cast as a linear transform in the cepstral domain. Building on this latter ...

2019

Trustworthy speaker recognition with minimal prior knowledge using neural networks

Hannah Muckenhirn

The performance of speaker recognition systems has considerably improved in the last decade. This is mainly due to the development of Gaussian mixture model-based systems and in particular to the use of i-vectors. These systems handle relatively well noise ...

EPFL2019

A Bayesian Approach To Inter-Task Fusion For Speaker Recognition

Petr Motlicek, Subhadeep Dey

In i-vector based speaker recognition systems, back-end classifiers are trained to factor out nuisance information and retain only the speaker identity. As a result, variabilities arising due to gender, language and accent ( among many others) are suppress ...

IEEE2019

A BAYESIAN APPROACH TO INTER-TASK FUSION FOR SPEAKER RECOGNITION

Petr Motlicek, Subhadeep Dey

2019

Combining the SNR Spectrum with a Cochlear Model

Philip Neil Garner

The SNR spectrum was previously introduced as a natural consequence of using cepstral normalisa- tion in speech recognition; it is closely related to the articulation index of Fletcher. Motivated initially by a theoretical difficulty in frequency warping, ...

Idiap2018

Towards directly modeling raw speech signal for speaker verification using CNNs

Sébastien Marcel, Hannah Muckenhirn

Speaker verification systems traditionally extract and model cepstral features or filter bank energies from the speech signal. In this paper, inspired by the success of neural network-based approaches to model directly raw speech signal for applications su ...

IEEE2018

Modified group delay feature based total variability space modelling for speaker recognition

In this paper, modified group delay (MODGD) features are used to model target speakers in the Total Variability Space (TVS) framework for speaker recognition. MODGD based features have been shown to improve speaker recognition performance owing to the abil ...

2015

Sparse Gammatone Signal Model Predicts Perceived Noise Intrusiveness

Hervé Bourlard, Raphaël Marc Ullmann

Is it possible to predict the intrusiveness of background noise in speech signals as perceived by humans? Such a question is important to the automatic evaluation of speech enhancement systems, including those designed for new wideband speech telephony, an ...

Idiap2014

Bias Adaptation for Vocal Tract Length Normalization

Philip Neil Garner, John David Scott Dines, Lakshmi Babu Saheer

Vocal tract length normalisation (VTLN) is a well known rapid adaptation technique. VTLN as a linear transformation in the cepstral domain results in the scaling and translation factors. The warping factor represents the spectral scaling parameter. While, ...

Idiap2013

A Savitzky-Golay Filtering Perspective of Dynamic Feature Computation

Mathew Magimai Doss, Chandra Sekhar Seelamantula

We address the classical problem of delta feature computation, and interpret the operation involved in terms of Savitzky-Golay (SG) filtering. Features such as the mel-frequency cepstral coefficients (MFCCs), obtained based on short-time spectra of the spe ...

2013

Graph Chatbot

Chattez avec Graph Search

Posez n’importe quelle question sur les cours, conférences, exercices, recherches, actualités, etc. de l’EPFL ou essayez les exemples de questions ci-dessous.

AVERTISSEMENT : Le chatbot Graph n'est pas programmé pour fournir des réponses explicites ou catégoriques à vos questions. Il transforme plutôt vos questions en demandes API qui sont distribuées aux différents services informatiques officiellement administrés par l'EPFL. Son but est uniquement de collecter et de recommander des références pertinentes à des contenus que vous pouvez explorer pour vous aider à répondre à vos questions.

Connectez-vous pour utiliser Chat avec Graph Search