Spatiotemporal Inconsistency Learning for DeepFake Video Detection

09/04/2021
by   Zhihao Gu, et al.
23

The rapid development of facial manipulation techniques has aroused public concerns in recent years. Following the success of deep learning, existing methods always formulate DeepFake video detection as a binary classification problem and develop frame-based and video-based solutions. However, little attention has been paid to capturing the spatial-temporal inconsistency in forged videos. To address this issue, we term this task as a Spatial-Temporal Inconsistency Learning (STIL) process and instantiate it into a novel STIL block, which consists of a Spatial Inconsistency Module (SIM), a Temporal Inconsistency Module (TIM), and an Information Supplement Module (ISM). Specifically, we present a novel temporal modeling paradigm in TIM by exploiting the temporal difference over adjacent frames along with both horizontal and vertical directions. And the ISM simultaneously utilizes the spatial information from SIM and temporal information from TIM to establish a more comprehensive spatial-temporal representation. Moreover, our STIL block is flexible and could be plugged into existing 2D CNNs. Extensive experiments and visualizations are presented to demonstrate the effectiveness of our method against the state-of-the-art competitors.

READ FULL TEXT

page 1

page 3

page 7

page 8

research
07/05/2022

Spatial-Temporal Frequency Forgery Clue for Video Forgery Detection in VIS and NIR Scenario

In recent years, with the rapid development of face editing and generati...
research
07/16/2018

Spatial-Temporal Synergic Residual Learning for Video Person Re-Identification

We tackle the problem of person re-identification in video setting in th...
research
11/06/2018

A `Little Bit' Too Much? High Speed Imaging from Sparse Photon Counts

Recent advances in photographic sensing technologies have made it possib...
research
06/24/2021

Detection of Deepfake Videos Using Long Distance Attention

With the rapid progress of deepfake techniques in recent years, facial v...
research
12/16/2019

Towards Omni-Supervised Face Alignment for Large Scale Unlabeled Videos

In this paper, we propose a spatial-temporal relational reasoning networ...
research
07/01/2022

Motion Compensated Frequency Selective Extrapolation for Error Concealment in Video Coding

Although wireless and IP-based access to video content gives a new degre...
research
07/07/2021

Cross-View Exocentric to Egocentric Video Synthesis

Cross-view video synthesis task seeks to generate video sequences of one...

Please sign up or login with your details

Forgot password? Click here to reset