Publication

Automatic Temporal Alignment of AV Data with Confidence Estimation

Philip Neil Garner, John David Scott Dines, Danil Korchagin
2010
Article de conférence

Résumé

In this paper, we propose a new approach for the automatic audio-based temporal alignment with confidence estimation of audio-visual data, recorded by different cameras, camcorders or mobile phones during social events. All recorded data is temporally aligned based on ASR-related features with a common master track, recorded by a reference camera, and the corresponding confidence of alignment is estimated. The core of the algorithm is based on perceptual time-frequency analysis with a precision of 10 ms. The results show correct alignment in 99% of cases for a real life dataset and surpass the performance of cross correlation while keeping lower system requirements.

Source officielle

https://infoscience.epfl.ch/record/146092?ln=fr

À propos de ce résultat

Cette page est générée automatiquement et peut contenir des informations qui ne sont pas correctes, complètes, à jour ou pertinentes par rapport à votre recherche. Il en va de même pour toutes les autres pages de ce site. Veillez à vérifier les informations auprès des sources officielles de l'EPFL.

Graph Chatbot

Chattez avec Graph Search

Posez n’importe quelle question sur les cours, conférences, exercices, recherches, actualités, etc. de l’EPFL ou essayez les exemples de questions ci-dessous.

AVERTISSEMENT : Le chatbot Graph n'est pas programmé pour fournir des réponses explicites ou catégoriques à vos questions. Il transforme plutôt vos questions en demandes API qui sont distribuées aux différents services informatiques officiellement administrés par l'EPFL. Son but est uniquement de collecter et de recommander des références pertinentes à des contenus que vous pouvez explorer pour vous aider à répondre à vos questions.

Connectez-vous pour utiliser Chat avec Graph Search

Automatic Temporal Alignment of AV Data with Confidence Estimation

Graph Chatbot

Chattez avec Graph Search

Towards a multiscale point cloud structural similarity metric

Database Alignment with Gaussian Features

Rayleigh-Based Distributed Optical Fiber Sensing Using Least Mean Square Similarity

Towards a multiscale point cloud structural similarity metric

Rayleigh-Based Distributed Optical Fiber Sensing Using Least Mean Square Similarity

Database Alignment with Gaussian Features