Publication

Keyword Detection for Spontaneous Speech

Related publications (32)

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

Speech recognition with speech synthesis models by marginalising over decision tree leaves

John David Scott Dines, Hui Liang, Lakshmi Babu Saheer

There has been increasing interest in the use of unsupervised adaptation for the personalisation of text-to-speech (TTS) voices, particularly in the context of speech-to-speech translation. This requires that we are able to generate adaptation transforms f ...

Idiap2009

Visual feature analysis for audio-visual speech recognition

Ivana Arsic de Heras Ciechomska

Humans perceive their surrounding environment in a multimodal manner by using multi-sensory inputs combined in a coordinated way. Various studies in psychology and cognitive science indicate the multimodal nature of human speech production and perception. ...

EPFL2008

Efficient integration of automated speech recognition in the framework of dialogue-based vocal systems

In this work, we propose different strategies for efficiently integrating an automated speech recognition module in the framework of a dialogue-based vocal system. The aim is the study of different ways leading to the improvement of the quality and robustn ...

EPFL2005

An Online Audio Indexing System

Hervé Bourlard, Jitendra Ajmera

This paper presents overview of an online audio indexing system, which creates a searchable index of speech content embedded in digitized audio files. This system is based on our recently proposed offline audio segmentation techniques. As the data arrives ...

2004

An Online Audio Indexing System

Hervé Bourlard, Jitendra Ajmera

IDIAP2003

HMM mixtures (HMM2) for robust speech recognition

Katrin Weber

State-of-the-art automatic speech recognition (ASR) techniques are typically based on hidden Markov models (HMMs) for the modeling of temporal sequences of feature vectors extracted from the speech signal. At the level of each HMM state, Gaussian mixture m ...

EPFL2003

HMM Mixtures (HMM2) for Robust Speech Recognition

Katrin Weber

Ecole Polytechnique Federale de Lausanne2003

HMM Mixtures (HMM2) for Robust Speech Recognition

Katrin Weber

IDIAP2003

A Pragmatic View of the Application of HMM2 for ASR

Hervé Bourlard, Samy Bengio, Katrin Weber

This report investigates the HMM2 approach recently introduced in the framework of automatic speech recognition. HMM2 can be seen as a mixture of HMMs, where a conventional primary HMM (processing a time series of speech data) is supported on a lower level ...

IDIAP2001

Acoustico-articulatory inversion of unequal-length tube models through lattice inverse filtering

Sacha Krstulovic

Constraints related to the Distinctive Regions and Modes (DRM) speech production model are incorporated in the framework of speech analysis by inverse filtering. It is shown that the analogy between Auto-Regressive modeling and acoustic models based on aco ...

IDIAP1998