ChatGPT-4 Outperforms Experts and Crowd Workers in Annotating Political Twitter Messages with Zero-Shot Learning

04/13/2023
by   Petter Törnberg, et al.
0

This paper assesses the accuracy, reliability and bias of the Large Language Model (LLM) ChatGPT-4 on the text analysis task of classifying the political affiliation of a Twitter poster based on the content of a tweet. The LLM is compared to manual annotation by both expert classifiers and crowd workers, generally considered the gold standard for such tasks. We use Twitter messages from United States politicians during the 2020 election, providing a ground truth against which to measure accuracy. The paper finds that ChatGPT-4 has achieves higher accuracy, higher reliability, and equal or lower bias than the human classifiers. The LLM is able to correctly annotate messages that require reasoning on the basis of contextual knowledge, and inferences around the author's intentions - traditionally seen as uniquely human abilities. These findings suggest that LLM will have substantial impact on the use of textual data in the social sciences, by enabling interpretive research at a scale.

READ FULL TEXT
research
03/27/2023

ChatGPT Outperforms Crowd-Workers for Text-Annotation Tasks

Many NLP applications require manual data annotations for a variety of t...
research
05/14/2020

Can The Crowd Identify Misinformation Objectively? The Effects of Judgment Scale and Assessor's Background

Truthfulness judgments are a fundamental step in the process of fighting...
research
05/23/2023

Diverse Perspectives Can Mitigate Political Bias in Crowdsourced Content Moderation

In recent years, social media companies have grappled with defining and ...
research
08/25/2023

Prompting a Large Language Model to Generate Diverse Motivational Messages: A Comparison with Human-Written Messages

Large language models (LLMs) are increasingly capable and prevalent, and...
research
09/03/2023

How Crowd Worker Factors Influence Subjective Annotations: A Study of Tagging Misogynistic Hate Speech in Tweets

Crowdsourced annotation is vital to both collecting labelled data to tra...
research
04/25/2023

Fairness and Bias in Truth Discovery Algorithms: An Experimental Analysis

Machine learning (ML) based approaches are increasingly being used in a ...
research
05/09/2019

A joint text mining-rank size investigation of the rhetoric structures of the US Presidents' speeches

This work presents a text mining context and its use for a deep analysis...

Please sign up or login with your details

Forgot password? Click here to reset