Publication

Automatic Temporal Alignment of AV Data with Confidence Estimation

Philip Neil Garner, John David Scott Dines, Danil Korchagin
2010
Conference paper

Abstract

In this paper, we propose a new approach for the automatic audio-based temporal alignment with confidence estimation of audio-visual data, recorded by different cameras, camcorders or mobile phones during social events. All recorded data is temporally aligned based on ASR-related features with a common master track, recorded by a reference camera, and the corresponding confidence of alignment is estimated. The core of the algorithm is based on perceptual time-frequency analysis with a precision of 10 ms. The results show correct alignment in 99% of cases for a real life dataset and surpass the performance of cross correlation while keeping lower system requirements.

Official source

https://infoscience.epfl.ch/record/146092?ln=en

About this result

This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

Automatic Temporal Alignment of AV Data with Confidence Estimation

Graph Chatbot

Chat with Graph Search

Towards a multiscale point cloud structural similarity metric

Database Alignment with Gaussian Features

Rayleigh-Based Distributed Optical Fiber Sensing Using Least Mean Square Similarity

Rayleigh-Based Distributed Optical Fiber Sensing Using Least Mean Square Similarity

Database Alignment with Gaussian Features

Towards a multiscale point cloud structural similarity metric