Publication

Learning Multimodal Temporal Representation for Dubbing Detection in Broadcast Media