Towards Corpus-Scale Discovery of Selection Biases in News Coverage: Comparing What Sources Say About Entities as a Start

04/06/2023
by   Sihao Chen, et al.
0

News sources undergo the process of selecting newsworthy information when covering a certain topic. The process inevitably exhibits selection biases, i.e. news sources' typical patterns of choosing what information to include in news coverage, due to their agenda differences. To understand the magnitude and implications of selection biases, one must first discover (1) on what topics do sources typically have diverging definitions of "newsworthy" information, and (2) do the content selection patterns correlate with certain attributes of the news sources, e.g. ideological leaning, etc. The goal of the paper is to investigate and discuss the challenges of building scalable NLP systems for discovering patterns of media selection biases directly from news content in massive-scale news corpora, without relying on labeled data. To facilitate research in this domain, we propose and study a conceptual framework, where we compare how sources typically mention certain controversial entities, and use such as indicators for the sources' content selection preferences. We empirically show the capabilities of the framework through a case study on NELA-2020, a corpus of 1.8M news articles in English from 519 news sources worldwide. We demonstrate an unsupervised representation learning method to capture the selection preferences for how sources typically mention controversial entities. Our experiments show that that distributional divergence of such representations, when studied collectively across entities and news sources, serve as good indicators for an individual source's ideological leaning. We hope our findings will provide insights for future research on media selection biases.

READ FULL TEXT
research
04/16/2019

Selection Bias in News Coverage: Learning it, Fighting it

News entities must select and filter the coverage they broadcast through...
research
01/14/2023

Unveiling the Hidden Agenda: Biases in News Reporting and Consumption

One of the most pressing challenges in the digital media landscape is un...
research
09/11/2021

To Protect and To Serve? Analyzing Entity-Centric Framing of Police Violence

Framing has significant but subtle effects on public opinion and policy....
research
08/29/2018

Analyze Unstructured Data Patterns for Conceptual Representation

Online news media provides aggregated news and stories from different so...
research
04/20/2021

Hidden Biases in Unreliable News Detection Datasets

Automatic unreliable news detection is a research problem with great pot...
research
05/16/2022

SciLander: Mapping the Scientific News Landscape

The COVID-19 pandemic has fueled the spread of misinformation on social ...
research
08/05/2021

Designing Transparency Cues in Online News Platforms to Promote Trust: Journalists' Consumers' Perspectives

As news organizations embrace transparency practices on their websites t...

Please sign up or login with your details

Forgot password? Click here to reset