Multi-pose lipreading and audio-visual speech recognition
Publications associées (71)
Graph Chatbot
Chattez avec Graph Search
Posez n’importe quelle question sur les cours, conférences, exercices, recherches, actualités, etc. de l’EPFL ou essayez les exemples de questions ci-dessous.
AVERTISSEMENT : Le chatbot Graph n'est pas programmé pour fournir des réponses explicites ou catégoriques à vos questions. Il transforme plutôt vos questions en demandes API qui sont distribuées aux différents services informatiques officiellement administrés par l'EPFL. Son but est uniquement de collecter et de recommander des références pertinentes à des contenus que vous pouvez explorer pour vous aider à répondre à vos questions.
We studied the spatiotemporal characteristics of cortical activity in early visual areas and the fusiform gyri (FG) by means of magnetoencephalography (MEG). Subjects performed a visual classification task, in which letters and visually similar pseudolette ...
Vision is dynamic. After their onset, visual stimuli undergo a complex cascade of processes before awareness is reached. Even after more than a century of research, the timing of these processes is still largely unknown. In particular, how the brain determ ...
We present a method for dynamically integrating audio-visual information for speech recognition, based on the estimated reliability of the audio and visual streams. Our method uses an information theoretic measure, the entropy derived from the state probab ...
The human brain analyzes a visual object first by basic feature detectors. On the objects way to a conscious percept, these features are integrated in subsequent stages of the visual hierarchy. The time course of this feature integration is largely unknown ...
Association for Research in Vision and Ophthalmology2009
To cope with the continuously incoming stream of input, the visual system has to group information across space and time. Usually, spatial and temporal grouping are investigated separately. However, recent findings revealed that these two grouping mechanis ...
Association for Research in Vision and Ophthalmology2010
A quantitative measure of relevance is proposed for the task of constructing visual feature sets which are at the same time relevant and compact. A feature's relevance is given by the amount of information that it contains about the problem, while compactn ...
The primate visual system is organized into two parallel anatomical pathways, both originating in early visual areas but terminating in posterior parietal or inferior temporal regions. Classically, these two pathways have been thought to subserve spatial v ...
Person identification using audio or visual biometrics is a well-studied problem in pattern recognition. In this scenario, both training and testing are done on the same modalities. However, there can be situations where this condition is not valid, i.e. t ...
Based on 6 years of continuous measurements, we have analysed in detail the occupancy, thermal and visual parameters influencing actions on shading devices in order to derive an accurate model for the prediction of their usage in office buildings. This art ...
Person identification using audio or visual biometrics is a well-studied problem in pattern recognition. In this scenario, both training and testing are done on the same modalities. However, there can be situations where this condition is not valid, i.e. t ...