IML-ViT: Image Manipulation Localization by Vision Transformer

07/27/2023
by   Xiaochen Ma, et al.
0

Advanced image tampering techniques are increasingly challenging the trustworthiness of multimedia, leading to the development of Image Manipulation Localization (IML). But what makes a good IML model? The answer lies in the way to capture artifacts. Exploiting artifacts requires the model to extract non-semantic discrepancies between the manipulated and authentic regions, which needs to compare differences between these two areas explicitly. With the self-attention mechanism, naturally, the Transformer is the best candidate. Besides, artifacts are sensitive to image resolution, amplified under multi-scale features, and massive at the manipulation border. Therefore, we formulate the answer to the former question as building a ViT with high-resolution capacity, multi-scale feature extraction capability, and manipulation edge supervision. We term this simple but effective ViT paradigm as the IML-ViT, which has great potential to become a new benchmark for IML. Extensive experiments on five benchmark datasets verified our model outperforms the state-of-the-art manipulation localization methods. Code and models are available at <https://github.com/SunnyHaze/IML-ViT>

READ FULL TEXT

page 1

page 5

page 8

page 13

research
11/06/2022

MSMG-Net: Multi-scale Multi-grained Supervised Metworks for Multi-task Image Manipulation Detection and Localization

With the rapid advances of image editing techniques in recent years, ima...
research
07/02/2022

Noise and Edge Based Dual Branch Image Manipulation Detection

Unlike ordinary computer vision tasks that focus more on the semantic co...
research
12/21/2022

TruFor: Leveraging all-round clues for trustworthy image forgery detection and localization

In this paper we present TruFor, a forensic framework that can be applie...
research
04/20/2021

M2TR: Multi-modal Multi-scale Transformers for Deepfake Detection

The widespread dissemination of forged images generated by Deepfake tech...
research
10/04/2022

CFL-Net: Image Forgery Localization Using Contrastive Learning

Conventional forgery localizing methods usually rely on different forger...
research
06/30/2023

Act3D: Infinite Resolution Action Detection Transformer for Robotic Manipulation

3D perceptual representations are well suited for robot manipulation as ...
research
09/17/2023

Effective Image Tampering Localization via Enhanced Transformer and Co-attention Fusion

Powerful manipulation techniques have made digital image forgeries be ea...

Please sign up or login with your details

Forgot password? Click here to reset