Publication

Low-latency speaker spotting with online diarization and detection

Related publications (39)

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

EMPLOYMENT OF SUBSPACE GAUSSIAN MIXTURE MODELS IN SPEAKER RECOGNITION

Petr Motlicek, Subhadeep Dey

This paper presents Subspace Gaussian Mixture Model (SGMM) approach employed as a probabilistic generative model to estimate speaker vector representations to be subsequently used in the speaker verification task. SGMMs have already been shown to significa ...

Idiap2015

The open source dynamics in geospatial research and education

Stéphane Joost

Peer reviewing is one of the core processes of science. While the typical blind system helps to improve original submissions, there are opportunities for academic publishing to learn from open source practices (commits, bug reports, feature requests, docum ...

2014

Privacy-Sensitive Audio Features for Conversational Speech Processing

Sree Hari Krishnan Parthasarathi

The work described in this thesis takes place in the context of capturing real-life audio for the analysis of spontaneous social interactions. Towards this goal, we wish to capture conversational and ambient sounds using portable audio recorders. Analysis ...

EPFL2011

Privacy-Sensitive Audio Features for Conversational Speech Processing

Sree Hari Krishnan Parthasarathi

Ecole Polytechnique Fédérale de Lausanne2011

Robustness of Phase based Features for Speaker Recognition

Sree Hari Krishnan Parthasarathi

This paper demonstrates the robustness of group-delay based features for speech processing. An analysis of group delay functions is presented which show that these features retain formant structure even in noise. Furthermore, a speaker verification task pe ...

2009

Robustness of Phase based Features for Speaker Recognition

Sree Hari Krishnan Parthasarathi

Idiap2009

Managing competing Communities of Practice: The impact of open source and knowledge-bridging on system-level learning

In this article we explore how learning across competing Communities of Practice (CoP) may benefit from actively managed processes, namely: building an explicit knowledge repository (open source), and setting incentives to crossing boundaries (knowledge-br ...

2007

Discriminative Keyword Spotting

Samy Bengio, David Grangier

This paper proposes a new approach for keyword spotting, which is not based on HMMs. The proposed method employs a new discriminative learning procedure, in which the learning phase aims at maximizing the area under the ROC curve, as this quantity is the m ...

2007

Juicer: A Weighted Finite-State Transducer speech decoder

John David Scott Dines, Darren Moore

A major component in the development of any speech recognition system is the decoder. As task complexities and, consequently, system complexities have continued to increase the decoding problem has become an increasingly significant component in the overal ...

IDIAP2006

Juicer: A Weighted Finite-State Transducer speech decoder

John David Scott Dines, Darren Moore

2006