Associating Audio-Visual Activity Cues in a Dominance Estimation Framework
Publications associées (102)
Graph Chatbot
Chattez avec Graph Search
Posez n’importe quelle question sur les cours, conférences, exercices, recherches, actualités, etc. de l’EPFL ou essayez les exemples de questions ci-dessous.
AVERTISSEMENT : Le chatbot Graph n'est pas programmé pour fournir des réponses explicites ou catégoriques à vos questions. Il transforme plutôt vos questions en demandes API qui sont distribuées aux différents services informatiques officiellement administrés par l'EPFL. Son but est uniquement de collecter et de recommander des références pertinentes à des contenus que vous pouvez explorer pour vous aider à répondre à vos questions.
In this paper, we propose a new approach for the automatic audio-based out-of-scene detection of audio-visual data, recorded by different cameras, camcorders or mobile phones during social events. All recorded data is clustered to out-of-scene and in-scene ...
In this paper, we propose a new approach for the automatic audio-based out-of-scene detection of audio-visual data, recorded by different cameras, camcorders or mobile phones during social events. All recorded data is clustered to out-of-scene and in-scene ...
In this paper, we propose a new concept how tempo-social information about moments of togetherness within a social group of people can be retrieved in the palm of the hand from social context. The social context is digitised by audio logging of the same us ...
Correlation-based alignment is an alternative alignment method for electron beam lithography. Using complex marker patterns, such as Penrose patterns, which contain more positional information, greater alignment accuracy can be achieved. Correlation-based ...
A method to synchronize impulse radio signal in a receiver based on a cross-correlation between an input signal and a template pulse train is described. The method comprises the steps of receiving a radio signal, performing a correlation between the acquir ...
Analytic queueing network models often assume infinite capacity queues due to the difficulty of grasping the between-queue correlation. This correlation can help to explain the propagation of congestion. We present an analytic queueing network model which ...
A novel model is presented to learn bimodally informative structures from audio-visual signals. The signal is represented as a sparse sum of audio- visual kernels. Each kernel is a bimodal function consisting of synchronous snippets of an audio waveform an ...
This paper presents a distributed coding scheme for the representation of 3D scenes captured by omnidirectional cameras. We consider a scenario with a pair of similar cameras that benefit from equivalent bandwidth and computational resources. The images ar ...
We address the problem of both estimating the dominant person in a meeting from a single audio source and identifying them visually in a multi-camera setting. We use a speaker diarization algorithm to perform speaker segmentation and clustering, representi ...
This paper presents a distributed coding scheme for the representation of 3D scenes captured by omnidirectional cameras. We consider a scenario with a pair of similar cameras that benefit from equivalent bandwidth and computational resources. The images ar ...