Efficient Temporally-Aware DeepFake Detection using H.264 Motion Vectors

Video DeepFakes are fake media created with Deep Learning (DL) that manipulate a person’s expression or identity. Most current DeepFake detection methods analyze each frame independently, ignoring inconsistencies and unnatural movements between frames. Some newer methods employ optical flow models to capture this temporal aspect, but they are computationally expensive. In contrast, we propose using the related but often ignored Motion Vectors (MVs) and Information Masks (IMs) from the H.264 video codec, to detect temporal inconsistencies in DeepFakes. Our experiments show that this approach is effective and has minimal computational costs, compared with per-frame RGB-only methods. This could lead to new, real-time temporally aware DeepFake detection methods for video calls and streaming.

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

Efficient Temporally-Aware DeepFake Detection using H.264 Motion Vectors

Graph Chatbot

Chat with Graph Search

Advancing Self-Supervised Deep Learning for 3D Scene Understanding

Assessment framework for deepfake detection in real-world situations

Deep Generative Models for Autonomous Driving: from Motion Forecasting to Realistic Image Synthesis

Advancing Self-Supervised Deep Learning for 3D Scene Understanding

Assessment framework for deepfake detection in real-world situations

Deep Generative Models for Autonomous Driving: from Motion Forecasting to Realistic Image Synthesis