Concept

Wireless speaker

Related publications (23)

Template-matching for text-dependent speaker verification

In the last decade, i-vector and Joint Factor Analysis (JFA) approaches to speaker modeling have become ubiquitous in the area of automatic speaker recognition. Both of these techniques involve the computation of posterior probabilities, using either Gauss ...

2017

SYSTEM FUSION AND SPEAKER LINKING FOR LONGITUDINAL DIARIZATION OF TV SHOWS

Hervé Bourlard, Petr Motlicek

Performing speaker diarization while uniquely identifying the speakers in a collection of audio recordings is a challenging task. Based on our previous work on speaker diarization and linking, we developed a system for diarizing longitudinal TV show data s ...

IEEE2016

Monitoring of plastics in freshwater environments in Switzerland

Florian Faure

The German Environment Agency (UBA) and the German Federal Institute of Hydrology (BfG) organ- ised a conference on plastics in freshwater environments on behalf of the Federal Ministry for the Environment, Nature Conservation, Building and Nuclear Safety ...

2016

Modified group delay feature based total variability space modelling for speaker recognition

In this paper, modified group delay (MODGD) features are used to model target speakers in the Total Variability Space (TVS) framework for speaker recognition. MODGD based features have been shown to improve speaker recognition performance owing to the abil ...

2015

Comparison of Two Methods for Unsupervised Person Identification in TV Shows

Jean-Marc Odobez, Paul Gay

We address the task of identifying people appearing in TV shows. The target persons are all people whose identity is said or written, like the journalists and the well known people, as politicians, athletes, celebrities, etc. In our approach, overlaid name ...

2014

On the Improvements of Uni-modal and Bi-modal Fusions of Speaker and Face Recognition for Mobile Biometrics

Sébastien Marcel, Laurent El Shafey

The MOBIO database provides a challenging test-bed for speaker and face recognition systems because it includes voice and face samples as they would appear in forensic scenarios. In this paper, we investigate uni-modal and bi-modal multi-algorithm fusion u ...

2013

On the Improvements of Uni-modal and Bi-modal Fusions of Speaker and Face Recognition for Mobile Biometrics

Sébastien Marcel, Laurent El Shafey

Idiap2013

Bi-Modal Person Recognition on a Mobile Phone: using mobile phone data

Sébastien Marcel, Norman Hoon Thian Poh, Timothy Cootes

This paper presents a novel fully automatic bi-modal, face and speaker, recognition system which runs in real-time on a mobile phone. The implemented system runs in real-time on a Nokia N900 and demonstrates the feasibility of performing both automatic fac ...

2012

Bi-Modal Person Recognition on a Mobile Phone: using mobile phone data

Sébastien Marcel, Norman Hoon Thian Poh, Timothy Cootes

Idiap2012

Lower and upper bounds for approximation of the Kullback-Leibler divergence between Gaussian Mixture Models

Jean-Philippe Thiran, Finnian Paul Kelly

Many speech technology systems rely on Gaussian Mixture Models (GMMs). The need for a comparison between two GMMs arises in applications such as speaker verification, model selection or parameter estimation. For this purpose, the Kullback-Leibler (KL) dive ...

Ieee2012

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.