Concept

Multimodal sentiment analysis

Related publications (68)

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

Multimodal feature extraction and fusion for audio-visual speech recognition

Mihai Gurban

Multimodal signal processing analyzes a physical phenomenon through several types of measures, or modalities. This leads to the extraction of higher-quality and more reliable information than that obtained from single-modality signals. The advantage is two ...

EPFL2009

KL Realignment for Speaker Diarization with Multiple Feature Streams

Hervé Bourlard, Fabio Valente, Deepu Vijayasenan

This paper aims at investigating the use of Kullback-Leibler (KL) divergence based realignment with application to speaker diarization. The use of KL divergence based realignment operates directly on the speaker posterior distribution estimates and is comp ...

2009

Multimodal Speaker Localization from Omnidirectional Videos

Jean-Philippe Thiran, Mihai Gurban, Pascal Reuse

The use of omnidirectional cameras for videoconferencing promises to simplify the hardware setup necessary for large groups of participants. We investigate the use of a multimodal speaker detection algorithm on audio-visual sequences captured with such a c ...

2009

Object Category Detection using Audio-visual Cues

Barbara Caputo, Jie Luo

Categorization is one of the fundamental building blocks of cognitive systems. Object categorization has traditionally been addressed in the vision domain, even though cognitive agents are intrinsically multimodal. Indeed, biological systems combine severa ...

2008

Visual feature analysis for audio-visual speech recognition

Ivana Arsic de Heras Ciechomska

Humans perceive their surrounding environment in a multimodal manner by using multi-sensory inputs combined in a coordinated way. Various studies in psychology and cognitive science indicate the multimodal nature of human speech production and perception. ...

EPFL2008

Classification with class-independent quality information for biometric verification

Krzysztof Kryszczuk

Biometric identity verification systems frequently face the challenges of non-controlled conditions of data acquisition. Under such conditions biometric signals may suffer from quality degradation due to extraneous, identity-independent factors. It has bee ...

EPFL2008

Design of multimodal dialogue-based systems

Miroslav Melichar

Multimodal dialogue systems integrate advanced (often spoken) language technologies within human-computer interaction methods. Such complex systems cannot be designed without extensive human expertise and systematic design guidelines taking into account th ...

EPFL2008

Discovering Human Routines from Cell Phone Data with Topic Models

Daniel Gatica-Perez, Katayoun Farrahi

We present a framework to automatically discover people's routines from information extracted by cell phones. The framework is built from a probabilistic topic model learned on novel bag type representations of activity-related cues (location, proximity an ...

2008

Discovering Human Routines from Cell Phone Data with Topic Models

Daniel Gatica-Perez, Katayoun Farrahi

IDIAP2008

Object Category Detection using Audio-visual Cues

Barbara Caputo, Jie Luo

IDIAP2007