Artificial Artificial Artificial Intelligence: Crowd Workers Widely Use Large Language Models for Text Production Tasks

06/13/2023
by   Veniamin Veselovsky, et al.
0

Large language models (LLMs) are remarkable data annotators. They can be used to generate high-fidelity supervised training data, as well as survey and experimental data. With the widespread adoption of LLMs, human gold–standard annotations are key to understanding the capabilities of LLMs and the validity of their results. However, crowdsourcing, an important, inexpensive way to obtain human annotations, may itself be impacted by LLMs, as crowd workers have financial incentives to use LLMs to increase their productivity and income. To investigate this concern, we conducted a case study on the prevalence of LLM usage by crowd workers. We reran an abstract summarization task from the literature on Amazon Mechanical Turk and, through a combination of keystroke detection and synthetic text classification, estimate that 33-46 workers used LLMs when completing the task. Although generalization to other, less LLM-friendly tasks is unclear, our results call for platforms, researchers, and crowd workers to find new ways to ensure that human data remain human, perhaps using the methodology proposed here as a stepping stone. Code/data: https://github.com/epfl-dlab/GPTurk

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/27/2023

ChatGPT Outperforms Crowd-Workers for Text-Annotation Tasks

Many NLP applications require manual data annotations for a variety of t...
research
12/20/2022

Needle in a Haystack: An Analysis of Finding Qualified Workers on MTurk for Summarization

The acquisition of high-quality human annotations through crowdsourcing ...
research
07/05/2023

Power-up! What Can Generative Models Do for Human Computation Workflows?

We are amidst an explosion of artificial intelligence research, particul...
research
05/22/2023

ChatGPT to Replace Crowdsourcing of Paraphrases for Intent Classification: Higher Diversity and Comparable Model Robustness

The emergence of generative large language models (LLMs) raises the ques...
research
08/14/2023

Detecting The Corruption Of Online Questionnaires By Artificial Intelligence

Online questionnaires that use crowd-sourcing platforms to recruit parti...
research
11/30/2019

Fooling the Crowd with Deep Learning-based Methods

Modern, state-of-the-art deep learning approaches yield human like perfo...
research
11/20/2020

Crowdsourcing Airway Annotations in Chest Computed Tomography Images

Measuring airways in chest computed tomography (CT) scans is important f...

Please sign up or login with your details

Forgot password? Click here to reset