Publication

A 3-D Audio-Visual Corpus of Affective Communication

Related publications (77)

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

Unsupervised Location-Based Segmentation of Multi-Party Speech

Jean-Marc Odobez, Guillaume Lathoud

Accurate detection and segmentation of spontaneous multi-party speech is crucial for a variety of applications, including speech acquisition and recognition, as well as higher-level event recognition. However, the highly sporadic nature of spontaneous spee ...

2004

Using physiological measures for emotional assessment: A computer-aided tool for cognitive and behavioral therapy

Daniel Thalmann, Bruno Herbelin, Olivier Renault, Helena Stephanie Grillon Le Gallennec

In the context of cognitive and behavioural therapies, the use of immersion technologies to replace classical exposure often improves the therapeutic process. As it is necessary to validate the efficiency of such a technique, both therapists and VR special ...

2004

Noisy Text Categorization

Alessandro Vinciarelli

This work presents categorization experiments performed over noisy texts. By noisy it is meant any text obtained through an extraction process (affected by errors) from media other than digital texts (e.g. transcriptions of speech recordings extracted with ...

IDIAP2004

HMM mixtures (HMM2) for robust speech recognition

Katrin Weber

State-of-the-art automatic speech recognition (ASR) techniques are typically based on hidden Markov models (HMMs) for the modeling of temporal sequences of feature vectors extracted from the speech signal. At the level of each HMM state, Gaussian mixture m ...

EPFL2003

HMM Mixtures (HMM2) for Robust Speech Recognition

Katrin Weber

Ecole Polytechnique Federale de Lausanne2003

HMM Mixtures (HMM2) for Robust Speech Recognition

Katrin Weber

IDIAP2003

Information Retrieval on Noisy Text

Hervé Bourlard, David Grangier, Alessandro Vinciarelli

Spoken Document Retrieval (SDR) consists in retrieving segments of a speech database that are relevant to a query. The state-of-the-art approach to the SDR problem consists in transcribing the speech data into digital text before applying common Informatio ...

IDIAP2003

Increasing Speech Recognition Noise Robustness with HMM2

Hervé Bourlard, Samy Bengio, Katrin Weber

The purpose of this paper is to investigate the behavior of HMM2 models for the recognition of noisy speech. It has previously been shown that HMM2 is able to model dynamically important structural information inherent in the speech signal, often correspon ...

2002

Error Correcting Posterior Combination for Robust Multi-Band Speech Recognition

Hervé Bourlard, Astrid Hagen

In human perception, the availability of context enhances recognition and renders it more robust to noise. Even if not all phonemes in a word (or words in a sentence etc.) are correctly perceived, humans can fill in missing parts with the help of cues from ...

IDIAP2001

Error Correcting Posterior Combination for Robust Multi-Band Speech Recognition

Hervé Bourlard, Astrid Hagen

2001