Publication

Low-latency speaker spotting with online diarization and detection

Publications associées (39)

Graph Chatbot

Chattez avec Graph Search

Posez n’importe quelle question sur les cours, conférences, exercices, recherches, actualités, etc. de l’EPFL ou essayez les exemples de questions ci-dessous.

AVERTISSEMENT : Le chatbot Graph n'est pas programmé pour fournir des réponses explicites ou catégoriques à vos questions. Il transforme plutôt vos questions en demandes API qui sont distribuées aux différents services informatiques officiellement administrés par l'EPFL. Son but est uniquement de collecter et de recommander des références pertinentes à des contenus que vous pouvez explorer pour vous aider à répondre à vos questions.

Connectez-vous pour utiliser Chat avec Graph Search

EMPLOYMENT OF SUBSPACE GAUSSIAN MIXTURE MODELS IN SPEAKER RECOGNITION

Petr Motlicek, Subhadeep Dey

This paper presents Subspace Gaussian Mixture Model (SGMM) approach employed as a probabilistic generative model to estimate speaker vector representations to be subsequently used in the speaker verification task. SGMMs have already been shown to significa ...

Idiap2015

The open source dynamics in geospatial research and education

Stéphane Joost

Peer reviewing is one of the core processes of science. While the typical blind system helps to improve original submissions, there are opportunities for academic publishing to learn from open source practices (commits, bug reports, feature requests, docum ...

2014

Privacy-Sensitive Audio Features for Conversational Speech Processing

Sree Hari Krishnan Parthasarathi

The work described in this thesis takes place in the context of capturing real-life audio for the analysis of spontaneous social interactions. Towards this goal, we wish to capture conversational and ambient sounds using portable audio recorders. Analysis ...

EPFL2011

Privacy-Sensitive Audio Features for Conversational Speech Processing

Sree Hari Krishnan Parthasarathi

Ecole Polytechnique Fédérale de Lausanne2011

Robustness of Phase based Features for Speaker Recognition

Sree Hari Krishnan Parthasarathi

This paper demonstrates the robustness of group-delay based features for speech processing. An analysis of group delay functions is presented which show that these features retain formant structure even in noise. Furthermore, a speaker verification task pe ...

2009

Robustness of Phase based Features for Speaker Recognition

Sree Hari Krishnan Parthasarathi

Idiap2009

Managing competing Communities of Practice: The impact of open source and knowledge-bridging on system-level learning

In this article we explore how learning across competing Communities of Practice (CoP) may benefit from actively managed processes, namely: building an explicit knowledge repository (open source), and setting incentives to crossing boundaries (knowledge-br ...

2007

Discriminative Keyword Spotting

Samy Bengio, David Grangier

This paper proposes a new approach for keyword spotting, which is not based on HMMs. The proposed method employs a new discriminative learning procedure, in which the learning phase aims at maximizing the area under the ROC curve, as this quantity is the m ...

2007

Juicer: A Weighted Finite-State Transducer speech decoder

John David Scott Dines, Darren Moore

A major component in the development of any speech recognition system is the decoder. As task complexities and, consequently, system complexities have continued to increase the decoding problem has become an increasingly significant component in the overal ...

IDIAP2006

Juicer: A Weighted Finite-State Transducer speech decoder

John David Scott Dines, Darren Moore

2006