BOOSTED BINARY FEATURES FOR NOISE-ROBUST SPEAKER VERIFICATION

The standard approach to speaker verification is to extract cepstral features from the speech spectrum and model them by generative or discriminative techniques. We propose a novel approach where a set of client-specific binary features carrying maximal discriminative information specific to the individual client are estimated from an ensemble of pair-wise comparisons of frequency components in magnitude spectra, using Adaboost algorithm. The final classifier is a simple linear combination of these selected features. Experiments on the XM2VTS database strictly according to a standard evaluation protocol have shown that although the proposed framework yields comparatively lower performance on clean speech, it significantly outperforms the state-of-the-art MFCC-GMM system in mismatched conditions with training on clean speech and testing on speech corrupted by four types of additive noise from the standard Noisex-92 database.

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

BOOSTED BINARY FEATURES FOR NOISE-ROBUST SPEAKER VERIFICATION

Graph Chatbot

Chat with Graph Search

Hitting with Probability One for Stochastic Heat Equations with Additive Noise

Does powder averaging remove dispersion bias in diffusion MRI diameter estimates within real 3D axonal architectures?

Propagation of singularities for the stochastic wave equation

Hitting with Probability One for Stochastic Heat Equations with Additive Noise

Does powder averaging remove dispersion bias in diffusion MRI diameter estimates within real 3D axonal architectures?

Propagation of singularities for the stochastic wave equation