Related publications (74)

Blind Audio-Visual Source Separation Using Sparse Redundant Representations

Pierre Vandergheynst, Gianluca Monaci, Anna Llagostera Casanovas

This report presents a new method to confront the Blind Audio Source Separation (BASS) problem, by means of audio and visual information. In a given mixture, we are able to locate the video sources first and, posteriorly, recover each source signal, only w ...
2006

Method and system for combining video with spatio-temporal alignment

Martin Vetterli, Paolo Prandoni, Serge Ayer

Given two video sequences (IS1, IS2), a composite video sequence can be generated which includes visual elements from each of the given sequences, suitably synchronized and represented in a chosen focal plane (figure 1). For example, given two video sequen ...
2006

A multimodal approach to extract optimized audio features for speaker detection

Jean-Philippe Thiran, Murat Kunt, Torsten Butz, Patricia Besson

We present a method that exploits an information theoretic framework to extract optimal audio features with respect to the video features. A simple measure of mutual information between the resulting audio features and the video ones allows to detect the a ...
IEEE2005

Semantic Video Analysis for Adaptive Content Delivery and Automatic Description

Touradj Ebrahimi

We present an encoding framework which exploits semantics for video content delivery. The video content is organized based on the idea of main content message. In the work reported in this paper, the main content message is extracted from the video data th ...
2005

Effect of Segmentation Method on Video Retrieval Performance

David Grangier, Alessandro Vinciarelli

This paper presents experiments that evaluate the effect of different video segmentation methods on text-based video retrieval. Segmentations relying on modalities like speech, video and text or their combination are compared with a baseline sliding window ...
2005

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.