Publication

Hierarchical Integration of Phonetic and Lexical Knowledge in Phone Posterior Estimation

Graph Chatbot

Chattez avec Graph Search

Posez n’importe quelle question sur les cours, conférences, exercices, recherches, actualités, etc. de l’EPFL ou essayez les exemples de questions ci-dessous.

AVERTISSEMENT : Le chatbot Graph n'est pas programmé pour fournir des réponses explicites ou catégoriques à vos questions. Il transforme plutôt vos questions en demandes API qui sont distribuées aux différents services informatiques officiellement administrés par l'EPFL. Son but est uniquement de collecter et de recommander des références pertinentes à des contenus que vous pouvez explorer pour vous aider à répondre à vos questions.

Connectez-vous pour utiliser Chat avec Graph Search

Towards mixed language speech recognition systems

Hervé Bourlard, David Imseng

Multilingual speech recognition obviously involves numerous research challenges, including common phoneme sets, adaptation on limited amount of training data, as well as mixed language recognition (common in many countries, like Switzerland). In this latte ...

2010

Towards mixed language speech recognition systems

Hervé Bourlard, David Imseng

Idiap2010

Multimodal feature extraction and fusion for audio-visual speech recognition

Mihai Gurban

Multimodal signal processing analyzes a physical phenomenon through several types of measures, or modalities. This leads to the extraction of higher-quality and more reliable information than that obtained from single-modality signals. The advantage is two ...

EPFL2009

Enhanced Phone Posteriors for Improving Speech Recognition Systems

Hervé Bourlard, Hamed Ketabdar

Using phone posterior probabilities has been increasingly explored for improving automatic speech recognition (ASR) systems. In this paper, we propose two approaches for hierarchically enhancing these phone posteriors, by integrating long acoustic context, ...

IDIAP2008

Exploiting Contextual Information for Improved Phoneme Recognition

Hynek Hermansky, Joel Praveen Pinto

In this paper, we investigate the significance of contextual information in a phoneme recognition system using the hidden Markov model - artificial neural network paradigm. Contextual information is probed at the feature level as well as at the output of t ...

2008

Robust overlapping speech recognition based on neural networks

John David Scott Dines, Weifeng Li

We address issues for improving hands-free speech recognition performance in the presence of multiple simultaneous speakers using multiple distant microphones. In this paper, a log spectral mapping is proposed to estimate the log mel-filterbank outputs of ...

IDIAP2007

Exploiting Contextual Information for Improved Phoneme Recognition

Hynek Hermansky, Joel Praveen Pinto

IDIAP2007

Detection and Recognition of Number Sequences in Spoken Utterances

Guillermo Aradilla, Jitendra Ajmera

In this paper we investigate the detection and recognition of sequences of numbers in spoken utterances. This is done in two steps: first, the entire utterance is decoded assuming that only numbers were spoken. In the second step, non-number segments (garb ...

2007

Detection and Recognition of Number Sequences in Spoken Utterances

Guillermo Aradilla, Jitendra Ajmera

IDIAP2007

Multimedia event modelling and recognition

Mark Barnard

The recognition of events in multimedia data is a challenging area of research. The growth in the amount of multimedia data being produced and stored increases the need for systems capable of automatically analysing this data. This analysis can aid in effi ...

EPFL2005