Publication

Audio-driven Nonlinear Video Diffusion

Related publications (32)

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

Blind Audio-Visual Source Separation Using Sparse Redundant Representations

Pierre Vandergheynst, Gianluca Monaci, Anna Llagostera Casanovas

This report presents a new method to confront the Blind Audio Source Separation (BASS) problem, by means of audio and visual information. In a given mixture, we are able to locate the video sources first and, posteriorly, recover each source signal, only w ...

2006

Multimodal Speaker Localization in a Probabilistic Framework

Jean-Philippe Thiran, Mihai Gurban

A multimodal probabilistic framework is proposed for the problem of finding the active speaker in a video sequence. We localize the current speaker's mouth in the image by using the video and the audio channels together. We propose a novel visual feature t ...

IEEE2006

Toward sparse and geometry adapted video approximations

Oscar Divorra Escoda

Video signals are sequences of natural images, where images are often modeled as piecewise-smooth signals. Hence, video can be seen as a 3D piecewise-smooth signal made of piecewise-smooth regions that move through time. Based on the piecewise-smooth model ...

EPFL2005

Adaptive video delivery using semantics

Olivier Steiger

The diffusion of network appliances such as cellular phones, personal digital assistants and hand-held computers has created the need to personalize the way media content is delivered to the end user. Moreover, recent devices, such as digital radio receive ...

EPFL2005

Multiresolution Segmentation of Natural Images: From linear to Non-Linear Scale-Space Representations

Pierre Vandergheynst, Oscar Divorra Escoda

In this paper, we introduce a framework that merges classical ideas borrowed from scale-space and multi-resolution segmentation with non-linear partial differential equations. A non-linear scale-space stack is constructed by means of an appropriate diffusi ...

2004

Perceptual Prefiltering for Video Coding

Touradj Ebrahimi

Semantic segmentation is generally associated with second generation video coders, or object-based coders. Object-based coders encode different video objects separately in order to achieve lower bitrates and to enable object-based functionalities. In this ...

IEEE2004

Modeling of 2D+1 texture movies for video coding

Julien Reichel, Gloria Menegaz

We propose a novel model-based coding system for video. Model-based coding aims at improving compression gain by replacing the non-informative image elements with some perceptually equivalent models. Images enclosing large textured regions are ideal candid ...

2003

Progressive mesh-based coding of arbitrary-shaped video objects

Touradj Ebrahimi, Murat Kunt

While the emerging MPEG-4 standard has raised the need for efficient object-based compression, the future MPEG-7 standard motivates further research in the field of progressive, quality-scalable, and semantic representations for indexing and retrieval appl ...

1998

Three-Dimensional Motion Estimation of Objects for Video Coding

Luciano Sbaiz

Three-dimensional (3-D) motion estimation is applied to the problem of motion compensation for video coding. We suppose that the video sequence consists of the perspective projections of a collection of rigid bodies which undergo a rototranslational motion ...

1998

Contour simplification and motion compensated coding

Murat Kunt

In order to solve the bottleneck problem of region-based coding scheme, which is the contour coding, lossy methods are introduced in this paper for both 2D and 3D (2D plus time) contour image(s). A non-linear filter by means of majority operation is design ...

1995