Evading Watermark based Detection of AI-Generated Content

05/05/2023
by   Zhengyuan Jiang, et al.
0

A generative AI model – such as DALL-E, Stable Diffusion, and ChatGPT – can generate extremely realistic-looking content, posing growing challenges to the authenticity of information. To address the challenges, watermark has been leveraged to detect AI-generated content. Specifically, a watermark is embedded into an AI-generated content before it is released. A content is detected as AI-generated if a similar watermark can be decoded from it. In this work, we perform a systematic study on the robustness of such watermark-based AI-generated content detection. We focus on AI-generated images. Our work shows that an attacker can post-process an AI-generated watermarked image via adding a small, human-imperceptible perturbation to it, such that the post-processed AI-generated image evades detection while maintaining its visual quality. We demonstrate the effectiveness of our attack both theoretically and empirically. Moreover, to evade detection, our adversarial post-processing method adds much smaller perturbations to the AI-generated images and thus better maintain their visual quality than existing popular image post-processing methods such as JPEG compression, Gaussian blur, and Brightness/Contrast. Our work demonstrates the insufficiency of existing watermark-based detection of AI-generated content, highlighting the urgent needs of new detection methods.

READ FULL TEXT

page 2

page 12

page 17

research
09/06/2023

My Art My Choice: Adversarial Protection Against Unruly AI

Generative AI is on the rise, enabling everyone to produce realistic con...
research
07/05/2023

Evade ChatGPT Detectors via A Single Space

ChatGPT brings revolutionary social value but also raises concerns about...
research
04/24/2023

Improving Synthetically Generated Image Detection in Cross-Concept Settings

New advancements for the detection of synthetic images are critical for ...
research
01/09/2021

Pushing the Envelope of Thin Crack Detection

In this study, we consider the problem of detecting cracks from the imag...
research
06/12/2023

AI-Generated Image Detection using a Cross-Attention Enhanced Dual-Stream Network

With the rapid evolution of AI Generated Content (AIGC), forged images p...
research
05/29/2023

Game of Tones: Faculty detection of GPT-4 generated content in university assessments

This study explores the robustness of university assessments against the...
research
06/02/2023

DeepfakeArt Challenge: A Benchmark Dataset for Generative AI Art Forgery and Data Poisoning Detection

The tremendous recent advances in generative artificial intelligence tec...

Please sign up or login with your details

Forgot password? Click here to reset