GPT Paternity Test: GPT Generated Text Detection with GPT Genetic Inheritance

05/21/2023
by   Xiao Yu, et al.
0

Large Language Models (LLMs) can generate texts that carry the risk of various misuses, including plagiarism, planting fake reviews on e-commerce platforms, or creating fake social media postings that can sway election results. Detecting whether a text is machine-generated has thus become increasingly important. While machine-learning-based detection strategies exhibit superior performance, they often lack generalizability, limiting their practicality. In this work, we introduce GPT Paternity Test (GPT-Pat), which reliably detects machine-generated text across varied datasets. Given a text under scrutiny, we leverage ChatGPT to generate a corresponding question and provide a re-answer to the question. By comparing the similarity between the original text and the generated re-answered text, it can be determined whether the text is machine-generated. GPT-Pat consists of a Siamese network to compute the similarity between the original text and the generated re-answered text and a binary classifier. Our method achieved an average accuracy of 94.57 generalization test sets, surpassing the state-of-the-art RoBERTa-based method by 12.34 RoBERTa-based method when it is attacked by re-translation and polishing.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/10/2023

Combat AI With AI: Counteract Machine-Generated Fake Restaurant Reviews on Social Media

Recent advances in generative models such as GPT may be used to fabricat...
research
10/06/2020

RoFT: A Tool for Evaluating Human Detection of Machine-Generated Text

In recent years, large neural networks for natural language generation (...
research
09/28/2020

Transformers Are Better Than Humans at Identifying Generated Text

Fake information spread via the internet and social media influences pub...
research
12/19/2019

Identifying Adversarial Sentences by Analyzing Text Complexity

Attackers create adversarial text to deceive both human perception and t...
research
01/08/2023

Mitigating Human and Computer Opinion Fraud via Contrastive Learning

We introduce the novel approach towards fake text reviews detection in c...
research
06/07/2019

Real or Fake? Learning to Discriminate Machine from Human Generated Text

Recent advances in generative modeling of text have demonstrated remarka...
research
05/14/2023

Watermarking Text Generated by Black-Box Language Models

LLMs now exhibit human-like skills in various fields, leading to worries...

Please sign up or login with your details

Forgot password? Click here to reset