Publications associées à Speech recognition with speech synthesis models by marginalising over decision tree leaves

Keyword Detection for Spontaneous Speech

Hervé Bourlard, Aude Billard, Weifeng Li

This paper presents a system for keyword detection in spontaneous speech. Keywords are predefined through a set of acoustic examples provided by the users. Keyword detection proceeds in two steps: keyword searching and verification. To address the problem ...

2009

Using Pitch as Prior Knowledge in Template-Based Speech Recognition

Hervé Bourlard, Guillermo Aradilla

In a previous paper on speech recognition, we showed that templates can better capture the dynamics of speech signal compared to parametric models such as hidden Markov models. The key point in template matching approaches is finding the most similar templ ...

2006

Using Pitch as Prior Knowledge in Template-Based Speech Recognition

Hervé Bourlard, Guillermo Aradilla

In a previous paper on speech recognition, we showed that templates can better capture the dynamics of speech signal compared to parametric models such as hidden Markov models. The key point in template matching approaches is finding the most similar templ ...

IDIAP2005

Robust audio segmentation

Jitendra Ajmera

Audio segmentation, in general, is the task of segmenting a continuous audio stream in terms of acoustically homogenous regions, where the rule of homogeneity depends on the task. This thesis aims at developing and investigating efficient, robust and unsup ...

EPFL2005

An Online Audio Indexing System

Hervé Bourlard, Jitendra Ajmera

This paper presents overview of an online audio indexing system, which creates a searchable index of speech content embedded in digitized audio files. This system is based on our recently proposed offline audio segmentation techniques. As the data arrives ...

2004

Robust Audio Segmentation

Hervé Bourlard, Jitendra Ajmera

Audio segmentation, in general, is the task of segmenting a continuous audio stream in terms of acoustically homogenous regions, where the rule of homogeneity depends on the task. This thesis aims at developing and investigating efficient, robust and unsup ...

IDIAP2004

Robust Audio Segmentation

Hervé Bourlard, Jitendra Ajmera

Audio segmentation, in general, is the task of segmenting a continuous audio stream in terms of acoustically homogenous regions, where the rule of homogeneity depends on the task. This thesis aims at developing and investigating efficient, robust and unsup ...

École Polytechnique Fédérale de Lausanne2004

Confidence Measures in Multiple pronunciations Modeling For Speaker Verification

Hervé Bourlard

This paper investigates the use of multiple pronunciations modeling for User-Customized Password Speaker Verification (UCP-SV). The main characteristic of the UCP-SV is that the system does not have any {\it a priori} knowledge about the password used by t ...

2004

An Online Audio Indexing System

Hervé Bourlard, Jitendra Ajmera

This paper presents overview of an online audio indexing system, which creates a searchable index of speech content embedded in digitized audio files. This system is based on our recently proposed offline audio segmentation techniques. As the data arrives ...

IDIAP2003

Confidence Measures in Multiple pronunciations Modeling For Speaker Verification

Hervé Bourlard

This paper investigates the use of multiple pronunciations modeling for User-Customized Password Speaker Verification (UCP-SV). The main characteristic of the UCP-SV is that the system does not have any {\it a priori} knowledge about the password used by t ...

IDIAP2003

Speech recognition with speech synthesis models by marginalising over decision tree leaves

Graph Chatbot

Chattez avec Graph Search

Keyword Detection for Spontaneous Speech

Using Pitch as Prior Knowledge in Template-Based Speech Recognition

Using Pitch as Prior Knowledge in Template-Based Speech Recognition

Robust audio segmentation

An Online Audio Indexing System

Robust Audio Segmentation

Robust Audio Segmentation

Confidence Measures in Multiple pronunciations Modeling For Speaker Verification

An Online Audio Indexing System

Confidence Measures in Multiple pronunciations Modeling For Speaker Verification

Using Pitch as Prior Knowledge in Template-Based Speech Recognition

An Online Audio Indexing System

Keyword Detection for Spontaneous Speech

An Online Audio Indexing System

Using Pitch as Prior Knowledge in Template-Based Speech Recognition

Robust audio segmentation

Robust Audio Segmentation

Robust Audio Segmentation

Confidence Measures in Multiple pronunciations Modeling For Speaker Verification

Confidence Measures in Multiple pronunciations Modeling For Speaker Verification