Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.
DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.
Tracking moving objects is a critical step for smart video surveillance systems. Despite the complexity increase, multiple camera systems exhibit the undoubted advantages of covering wide areas and handling the occurrence of occlusions by exploiting the di ...
Stereo vision is a usual method to obtain depth information from images. The problems encountered when applying well established algorithms to real-time applications are due to the high computational load required. In this article we address this issue by ...
In this paper, we consider the problem of speaker verification as a two-class object detection problem in computer vision, where the object instances are 1-D short-time spectral vectors obtained from the speech signal. More precisely, we investigate the ge ...
Stereo vision is a usual method to obtain depth information from images. The problems encountered when applying the majority of well established algorithms to provide this information are due to the high computational load required. This occurs in both the ...
This paper addresses the problem of distributed image coding in camera neworks. The correlation between multiple images of a scene captured from different viewpoints can be effiiciently modeled by local geometric transforms of prominent images features. Su ...
With the increasing demand of information for more immersive applications such as Google Street view or 3D movies, the efficient analysis of visual data from cameras has gained more importance. This visual information permits to extract some crucial inform ...
Humans perceive their surrounding environment in a multimodal manner by using multi-sensory inputs combined in a coordinated way. Various studies in psychology and cognitive science indicate the multimodal nature of human speech production and perception. ...
This paper proposes to apply the non parametric atlas registration framework we have recently developed in [6]. This technique derived from the optical flow model and the active contour framework allows to base the registration of an anatomical atlas on se ...
In this work, we propose new ways to employ 3D-ranging systems in advanced human-computer interfaces and show that they can be used for precise hand-tracking systems aiming at virtual keyboard or mouse applications. We first implement a Structured Light (S ...
In this paper, we consider the problem of speaker verification as a two-class object detection problem in computer vision, but the object instances are 1-D short-time spectral vectors obtained from the speech signal. More precisely, we investigate the gene ...