Publication

Audio-Visual Object Extraction using Graph Cuts

Related publications (35)

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

Audio-Visual Fusion

Anna Llagostera Casanovas

The perception that we have about the world is influenced by elements of diverse nature. Indeed humans tend to integrate information coming from different sensory modalities to better understand their environment. Following this observation, scientists hav ...

EPFL2011

Deformable shape models for 2D object segmentation

Robin Thandiackal

Given a set of images showing individual 2D instances of an object class, the goal is to learn object class deformation in 2D for segmentation automatically. Class deformation is modelled by linear combinations of basis shapes. Usually, given segmentation ...

2011

Audio-driven Nonlinear Video Diffusion

Pierre Vandergheynst, Anna Llagostera Casanovas

In this paper we present a novel nonlinear video diffusion approach based on the fusion of information in audio and video channels. Both modalities are efficiently combined into a diffusion coefficient that integrates the basic assumption in this domain, i ...

Institute of Electrical and Electronics Engineers2011

Finding Objects of Interest in Images using Saliency and Superpixels

Radhakrishna Achanta

The ability to automatically find objects of interest in images is useful in the areas of compression, indexing and retrieval, re-targeting, and so on. There are two classes of such algorithms – those that find any object of interest with no prior knowledg ...

EPFL2011

Unsupervised Extraction of Audio-Visual Objects

Pierre Vandergheynst, Anna Llagostera Casanovas

We propose a novel method to automatically detect and extract the video modality of the sound sources that are present in a scene. For this purpose, we first assess the synchrony between the moving objects captured with a video camera and the sounds record ...

Ieee Service Center, 445 Hoes Lane, Po Box 1331, Piscataway, Nj 08855-1331 Usa2011

Distributed audio coding for wireless hearing aids

Martin Vetterli, Olivier Roy

The aim of the invention is to provide inter-channel level differences ICLD related to audio signals for hearing aids. This aim is achieved by a method for computing ICLD from a first and second audio source signals, the first source signal being wired wit ...

2011

Semi-supervised Extraction of Audio-Visual Sources

Patricia Calatayud Martinez

This report presents a semi-supervised method to jointly extract audio-visual sources from a scene. It consist of applying a supervised method to segment the video signal followed by an automatic process to properly separate the audio track. This approach ...

2010

Blind Audio-Visual Source Separation based on Sparse Redundant Representations

Pierre Vandergheynst, Rémi Gribonval, Gianluca Monaci, Anna Llagostera Casanovas

In this paper we propose a novel method which is able to detect and separate audio-visual sources present in a scene. Our method exploits the correlation between the video signal captured with a camera and a synchronously recorded one-microphone audio trac ...

2010

Dense deformation field estimation for atlas registration using the active contour framework

Valérie Duay

A key research area in computer vision is image segmentation. Image segmentation aims at extracting objects of interest in images or video sequences. These objects contain relevant information for a given application. For example, a video surveillance appl ...

EPFL2008

Blind audiovisual source separation using sparse representations

Pierre Vandergheynst, Gianluca Monaci, Anna Llagostera Casanovas

In this work we present a method to jointly separate active audio and visual structures on a given mixture. Blind Audiovisual Source Separation is achieved exploiting the coherence between a video Signal and a one-microphone audio track. The efficient repr ...

Ieee Service Center, 445 Hoes Lane, Po Box 1331, Piscataway, Nj 08855-1331 Usa2007