Publications related to Robustness of Phase based Features for Speaker Recognition

Spectro-Temporal Features for Automatic Speech Recognition using Linear Prediction in Spectral Domain

Hynek Hermansky, Sriram Ganapathy, Samuel Thomas

Frequency Domain Linear Prediction (FDLP) provides an efficient way to represent temporal envelopes of a signal using auto-regressive models. For the input speech signal, we use FDLP to estimate temporal trajectories of sub-band energy by applying linear p ...

2008

Spectro-Temporal Features for Automatic Speech Recognition using Linear Prediction in Spectral Domain

Hynek Hermansky, Sriram Ganapathy, Samuel Thomas

Frequency Domain Linear Prediction (FDLP) provides an efficient way to represent temporal envelopes of a signal using auto-regressive models. For the input speech signal, we use FDLP to estimate temporal trajectories of sub-band energy by applying linear p ...

IDIAP2008

Using Pitch as Prior Knowledge in Template-Based Speech Recognition

Hervé Bourlard, Guillermo Aradilla

In a previous paper on speech recognition, we showed that templates can better capture the dynamics of speech signal compared to parametric models such as hidden Markov models. The key point in template matching approaches is finding the most similar templ ...

2006

Speaker recognition in noisy environments using auxiliary information and Bayesian networks

Speaker recognition systems achieve acceptable performance in controlled laboratory conditions. However, in real-life environments, the performance of a speaker recognition system degrades drastically, the principal cause being the mismatch that exists bet ...

EPFL2006

Using Pitch as Prior Knowledge in Template-Based Speech Recognition

Hervé Bourlard, Guillermo Aradilla

In a previous paper on speech recognition, we showed that templates can better capture the dynamics of speech signal compared to parametric models such as hidden Markov models. The key point in template matching approaches is finding the most similar templ ...

IDIAP2005

Robust audio segmentation

Jitendra Ajmera

Audio segmentation, in general, is the task of segmenting a continuous audio stream in terms of acoustically homogenous regions, where the rule of homogeneity depends on the task. This thesis aims at developing and investigating efficient, robust and unsup ...

EPFL2005

Can a Professional Imitator Fool a GMM-Based Speaker Verification System?

Samy Bengio

This paper presents an attempt at assessing empirically how a state-of-the-art text-independent speaker verification system behaves when confronted to imposting attempts from a professional imitator who perfectly knows how to imitate in particular the clie ...

IDIAP2005

Robust Audio Segmentation

Hervé Bourlard, Jitendra Ajmera

Audio segmentation, in general, is the task of segmenting a continuous audio stream in terms of acoustically homogenous regions, where the rule of homogeneity depends on the task. This thesis aims at developing and investigating efficient, robust and unsup ...

IDIAP2004

Robust Audio Segmentation

Hervé Bourlard, Jitendra Ajmera

Audio segmentation, in general, is the task of segmenting a continuous audio stream in terms of acoustically homogenous regions, where the rule of homogeneity depends on the task. This thesis aims at developing and investigating efficient, robust and unsup ...

École Polytechnique Fédérale de Lausanne2004

Confidence Measures in Multiple pronunciations Modeling For Speaker Verification

Hervé Bourlard

This paper investigates the use of multiple pronunciations modeling for User-Customized Password Speaker Verification (UCP-SV). The main characteristic of the UCP-SV is that the system does not have any {\it a priori} knowledge about the password used by t ...

2004

Robustness of Phase based Features for Speaker Recognition

Graph Chatbot

Chat with Graph Search

Spectro-Temporal Features for Automatic Speech Recognition using Linear Prediction in Spectral Domain

Spectro-Temporal Features for Automatic Speech Recognition using Linear Prediction in Spectral Domain

Using Pitch as Prior Knowledge in Template-Based Speech Recognition

Speaker recognition in noisy environments using auxiliary information and Bayesian networks

Using Pitch as Prior Knowledge in Template-Based Speech Recognition

Robust audio segmentation

Can a Professional Imitator Fool a GMM-Based Speaker Verification System?

Robust Audio Segmentation

Robust Audio Segmentation

Confidence Measures in Multiple pronunciations Modeling For Speaker Verification

Spectro-Temporal Features for Automatic Speech Recognition using Linear Prediction in Spectral Domain

Using Pitch as Prior Knowledge in Template-Based Speech Recognition

Robust audio segmentation

Spectro-Temporal Features for Automatic Speech Recognition using Linear Prediction in Spectral Domain

Robust Audio Segmentation

Using Pitch as Prior Knowledge in Template-Based Speech Recognition

Speaker recognition in noisy environments using auxiliary information and Bayesian networks

Robust Audio Segmentation

Can a Professional Imitator Fool a GMM-Based Speaker Verification System?

Confidence Measures in Multiple pronunciations Modeling For Speaker Verification