Hybrid Transformer Network for Deepfake Detection

08/11/2022
by   Sohail Ahmed Khan, et al.
0

Deepfake media is becoming widespread nowadays because of the easily available tools and mobile apps which can generate realistic looking deepfake videos/images without requiring any technical knowledge. With further advances in this field of technology in the near future, the quantity and quality of deepfake media is also expected to flourish, while making deepfake media a likely new practical tool to spread mis/disinformation. Because of these concerns, the deepfake media detection tools are becoming a necessity. In this study, we propose a novel hybrid transformer network utilizing early feature fusion strategy for deepfake video detection. Our model employs two different CNN networks, i.e., (1) XceptionNet and (2) EfficientNet-B4 as feature extractors. We train both feature extractors along with the transformer in an end-to-end manner on FaceForensics++, DFDC benchmarks. Our model, while having relatively straightforward architecture, achieves comparable results to other more advanced state-of-the-art approaches when evaluated on FaceForensics++ and DFDC benchmarks. Besides this, we also propose novel face cut-out augmentations, as well as random cut-out augmentations. We show that the proposed augmentations improve the detection performance of our model and reduce overfitting. In addition to that, we show that our model is capable of learning from considerably small amount of data.

READ FULL TEXT

page 2

page 3

research
08/11/2021

Video Transformer for Deepfake Detection with Incremental Learning

Face forgery by deepfake is widely spread over the internet and this rai...
research
07/13/2023

Deepfake Video Detection Using Generative Convolutional Vision Transformer

Deepfakes have raised significant concerns due to their potential to spr...
research
05/23/2021

COTR: Convolution in Transformer Network for End to End Polyp Detection

Purpose: Colorectal cancer (CRC) is the second most common cause of canc...
research
03/22/2023

Exploring Turkish Speech Recognition via Hybrid CTC/Attention Architecture and Multi-feature Fusion Network

In recent years, End-to-End speech recognition technology based on deep ...
research
06/12/2023

NPVForensics: Jointing Non-critical Phonemes and Visemes for Deepfake Detection

Deepfake technologies empowered by deep learning are rapidly evolving, c...
research
03/16/2020

Context-Transformer: Tackling Object Confusion for Few-Shot Detection

Few-shot object detection is a challenging but realistic scenario, where...

Please sign up or login with your details

Forgot password? Click here to reset