Machine-Made Media: Monitoring the Mobilization of Machine-Generated Articles on Misinformation and Mainstream News Websites

05/16/2023
by   Hans W. A. Hanley, et al.
0

With the increasing popularity of generative large language models (LLMs) like ChatGPT, an increasing number of news websites have begun utilizing them to generate articles. However, not only can these language models produce factually inaccurate articles on reputable websites but disreputable news sites can utilize these LLMs to mass produce misinformation. To begin to understand this phenomenon, we present one of the first large-scale studies of the prevalence of synthetic articles within online news media. To do this, we train a DeBERTa-based synthetic news detector and classify over 12.91 million articles from 3,074 misinformation and mainstream news websites. We find that between January 1, 2022 and April 1, 2023, the relative number of synthetic news articles increased by 79.4 342 using an interrupted-time-series, we show that while its release resulted in a marked increase in synthetic articles on small sites as well as misinformation news websites, there was not a corresponding increase on large mainstream news websites. Finally, using data from the social media platform Reddit, we find that social media users interacted more with synthetic articles in March 2023 relative to January 2022.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/31/2018

A Large-scale Study of Social Media Sources in News Articles

In this study, we closely look at the use of social media contents as so...
research
05/04/2020

A Systematic Media Frame Analysis of 1.5 Million New York Times Articles from 2000 to 2017

Framing is an indispensable narrative device for news media because even...
research
04/04/2023

The Great Awokening as a Global Phenomenon

Previous research has identified a post-2010 sharp increase of words use...
research
12/06/2019

DClaims: A Censorship Resistant Web Annotations System using IPFS and Ethereum

The proliferation of unreliable and biased information is a significant ...
research
05/28/2022

Happenstance: Utilizing Semantic Search to Track Russian State Media Narratives about the Russo-Ukrainian War On Reddit

In the buildup to and in the weeks following the Russian Federation's in...
research
12/01/2020

Online Suicide Games: A Form of Digital Self-harm or A Myth?

Online suicide games are claimed to involve a series of challenges, endi...
research
04/30/2023

NewsPanda: Media Monitoring for Timely Conservation Action

Non-governmental organizations for environmental conservation have a sig...

Please sign up or login with your details

Forgot password? Click here to reset