The Imitation Game: Detecting Human and AI-Generated Texts in the Era of Large Language Models

07/22/2023
by   Kadhim Hayawi, et al.
0

The potential of artificial intelligence (AI)-based large language models (LLMs) holds considerable promise in revolutionizing education, research, and practice. However, distinguishing between human-written and AI-generated text has become a significant task. This paper presents a comparative study, introducing a novel dataset of human-written and LLM-generated texts in different genres: essays, stories, poetry, and Python code. We employ several machine learning models to classify the texts. Results demonstrate the efficacy of these models in discerning between human and AI-generated text, despite the dataset's limited sample size. However, the task becomes more challenging when classifying GPT-generated text, particularly in story writing. The results indicate that the models exhibit superior performance in binary classification tasks, such as distinguishing human-generated text from a specific LLM, compared to the more complex multiclass tasks that involve discerning among human-generated and multiple LLMs. Our findings provide insightful implications for AI text detection while our dataset paves the way for future research in this evolving area.

READ FULL TEXT

page 9

page 17

page 21

page 24

page 31

page 32

page 33

page 34

research
09/14/2023

Generative AI Text Classification using Ensemble LLM Approaches

Large Language Models (LLMs) have shown impressive performance across a ...
research
05/13/2023

GPT-Sentinel: Distinguishing Human and ChatGPT Generated Content

This paper presents a novel approach for detecting ChatGPT-generated vs....
research
09/14/2023

Detecting ChatGPT: A Survey of the State of Detecting ChatGPT-Generated Text

While recent advancements in the capabilities and widespread accessibili...
research
04/10/2023

On the Possibilities of AI-Generated Text Detection

Our work focuses on the challenge of detecting outputs generated by Larg...
research
04/11/2023

Distinguishing ChatGPT(-3.5, -4)-generated and human-written papers through Japanese stylometric analysis

Text-generative artificial intelligence (AI), including ChatGPT, equippe...
research
07/23/2023

Towards Automatic Boundary Detection for Human-AI Hybrid Essay in Education

Human-AI collaborative writing has been greatly facilitated with the hel...
research
03/10/2023

ChatGPT as the Transportation Equity Information Source for Scientific Writing

Transportation equity is an interdisciplinary agenda that requires both ...

Please sign up or login with your details

Forgot password? Click here to reset