Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.
DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.
The main scope of this project is to identify the best method of confidence estimator whose performance could be reliable in comparison to multimodal fusion alone. To do that, three alternative approaches to prediction confidence estimation are presented a ...
Multi-band, multi-stream and multi-modal approaches have proven to be very successful both in experiments and in real-life applications, among which speech recognition and biometric authentication are of particular interest here. However, there is a lack o ...
In this paper, we discuss meetings as an application domain for multimedia content analysis. Meeting databases are a rich data source suitable for a variety of audio, visual and multi-modal tasks, including speech recognition, people and action recognition ...
This document presents a review on gestures for multi-modal interfaces and focus on hand gestures. It first introduces the role that the gesture modality plays in human communication. It then describes different types of gestures. Finally, it gives an over ...
A phosphino,oxazoline P,N-bidentate ligand, 4, contg. 3,5-di-tert-butylphenyl groups was prepd. In the Heck arylation of dihydrofuran, 4 affords higher ee's than either 2 or 3, the unsubstituted and m-dimethylphenyl analogs, resp. Several Pd(0) complexes o ...
In Web-based information commerce it is diffcult to disentangle presentation from process logic, and sometimes even data is not separate from the presentation. Consequently, it becomes crucial to define an abstract model for business processes and their ma ...
In this paper, we discuss meetings as an application domain for multimedia content analysis. Meeting databases are a rich data source suitable for a variety of audio, visual and multi-modal tasks, including speech recognition, people and action recognition ...
This paper considers design methodologies in order to develop voice-enabled interfaces for tour-guide robots deployed at the Robotics Exposition of the Swiss National Exhibition (Expo.02). Human–robot voice communication presents new challenges for design ...
In this article we review several successful extensions to the standard Hidden-Markov-Model/Artificial Neural Network (HMM/ANN) hybrid, which have recently made important contributions to the field of noise robust automatic speech recognition. The first ex ...
This paper considers design methodologies in order to develop voice-enabled interfaces for tour-guide robots to be deployed at the Robotics Exposition of the Swiss National Exhibition (Expo.02). Human-robot voice communication presents new challenges for d ...