Delving into Sequential Patches for Deepfake Detection

07/06/2022
by   Jiazhi Guan, et al.
6

Recent advances in face forgery techniques produce nearly visually untraceable deepfake videos, which could be leveraged with malicious intentions. As a result, researchers have been devoted to deepfake detection. Previous studies has identified the importance of local low-level cues and temporal information in pursuit to generalize well across deepfake methods, however, they still suffer from robustness problem against post-processings. In this work, we propose the Local- Temporal-aware Transformer-based Deepfake Detection (LTTD) framework, which adopts a local-to-global learning protocol with a particular focus on the valuable temporal information within local sequences. Specifically, we propose a Local Sequence Transformer (LST), which models the temporal consistency on sequences of restricted spatial regions, where low-level information is hierarchically enhanced with shallow layers of learned 3D filters. Based on the local temporal embeddings, we then achieve the final classification in a global contrastive way. Extensive experiments on popular datasets validate that our approach effectively spots local forgery cues and achieves state-of-the-art performance.

READ FULL TEXT
research
01/20/2023

Towards Robust Video Instance Segmentation with Temporal-Aware Transformer

Most existing transformer based video instance segmentation methods extr...
research
07/11/2023

Transaction Fraud Detection via Spatial-Temporal-Aware Graph Transformer

How to obtain informative representations of transactions and then perfo...
research
05/10/2018

Dealing with sequences in the RGBDT space

Most of the current research in computer vision is focused on working wi...
research
07/11/2022

Learning Spatial and Temporal Variations for 4D Point Cloud Segmentation

LiDAR-based 3D scene perception is a fundamental and important task for ...
research
09/15/2021

Hybrid Local-Global Transformer for Image Dehazing

Recently, the Vision Transformer (ViT) has shown impressive performance ...
research
09/11/2023

Graph Contextual Contrasting for Multivariate Time Series Classification

Contrastive learning, as a self-supervised learning paradigm, becomes po...
research
06/01/2023

DeepFake-Adapter: Dual-Level Adapter for DeepFake Detection

Existing deepfake detection methods fail to generalize well to unseen or...

Please sign up or login with your details

Forgot password? Click here to reset