Blind Audiovisual Separation based on Redundant Representations

In this work we present a method to perform a complete audiovisual source separation without need of previous information. This method is based on the assumption that sounds are caused by moving structures. Thus, an efficient representation of audio and video sequences allows to build relationships between synchronous structures on both modalities. A robust clustering algorithm groups video structures exhibiting strong correlations with the audio so that sources are counted and located in the image. Using such information and exploiting audio-video correlation, the audio sources activity is determined. Next, \emph{spectral} GMMs are learnt in time slots with only one source active so that it is possible to separate them in case of an audio mixture. Audio source separation performances are rigorously evaluated, clearly showing that the proposed algorithm performs efficiently and robustly.

Chattez avec Graph Search

Posez n’importe quelle question sur les cours, conférences, exercices, recherches, actualités, etc. de l’EPFL ou essayez les exemples de questions ci-dessous.

AVERTISSEMENT : Le chatbot Graph n'est pas programmé pour fournir des réponses explicites ou catégoriques à vos questions. Il transforme plutôt vos questions en demandes API qui sont distribuées aux différents services informatiques officiellement administrés par l'EPFL. Son but est uniquement de collecter et de recommander des références pertinentes à des contenus que vous pouvez explorer pour vous aider à répondre à vos questions.

Blind Audiovisual Separation based on Redundant Representations

Graph Chatbot

Chattez avec Graph Search

Acoustical Features as Knee Health Biomarkers: A Critical Analysis

ASAP: a Dataset of Aligned Scores and Performances for Piano Transcription

Inpainting of Long Audio Segments With Similarity Graphs

Inpainting of Long Audio Segments With Similarity Graphs

Acoustical Features as Knee Health Biomarkers: A Critical Analysis

ASAP: a Dataset of Aligned Scores and Performances for Piano Transcription