Large Language Models can be Guided to Evade AI-Generated Text Detection

05/18/2023
by   Ning Lu, et al.
0

Large Language Models (LLMs) have demonstrated exceptional performance in a variety of tasks, including essay writing and question answering. However, it is crucial to address the potential misuse of these models, which can lead to detrimental outcomes such as plagiarism and spamming. Recently, several detectors have been proposed, including fine-tuned classifiers and various statistical methods. In this study, we reveal that with the aid of carefully crafted prompts, LLMs can effectively evade these detection systems. We propose a novel Substitution-based In-Context example Optimization method (SICO) to automatically generate such prompts. On three real-world tasks where LLMs can be misused, SICO successfully enables ChatGPT to evade six existing detectors, causing a significant 0.54 AUC drop on average. Surprisingly, in most cases these detectors perform even worse than random classifiers. These results firmly reveal the vulnerability of existing detectors. Finally, the strong performance of SICO suggests itself as a reliable evaluation protocol for any new detector in this field.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/23/2023

Paraphrasing evades detectors of AI-generated text, but retrieval is an effective defense

To detect the deployment of large language models for malicious use case...
research
05/31/2023

Red Teaming Language Model Detectors with Language Models

The prevalence and high capacity of large language models (LLMs) present...
research
09/14/2022

PainPoints: A Framework for Language-based Detection of Chronic Pain and Expert-Collaborative Text-Summarization

Chronic pain is a pervasive disorder which is often very disabling and i...
research
02/19/2020

Attacking Neural Text Detectors

Machine learning based language models have recently made significant pr...
research
04/18/2023

Stochastic Parrots Looking for Stochastic Parrots: LLMs are Easy to Fine-Tune and Hard to Detect with other LLMs

The self-attention revolution allowed generative language models to scal...
research
03/13/2023

Vision-Language Models as Success Detectors

Detecting successful behaviour is crucial for training intelligent agent...
research
07/05/2023

Evade ChatGPT Detectors via A Single Space

ChatGPT brings revolutionary social value but also raises concerns about...

Please sign up or login with your details

Forgot password? Click here to reset