Introducing MBIB – the first Media Bias Identification Benchmark Task and Dataset Collection

04/25/2023
by   Martin Wessel, et al.
0

Although media bias detection is a complex multi-task problem, there is, to date, no unified benchmark grouping these evaluation tasks. We introduce the Media Bias Identification Benchmark (MBIB), a comprehensive benchmark that groups different types of media bias (e.g., linguistic, cognitive, political) under a common framework to test how prospective detection techniques generalize. After reviewing 115 datasets, we select nine tasks and carefully propose 22 associated datasets for evaluating media bias detection techniques. We evaluate MBIB using state-of-the-art Transformer techniques (e.g., T5, BART). Our results suggest that while hate speech, racial bias, and gender bias are easier to detect, models struggle to handle certain bias types, e.g., cognitive and political bias. However, our results show that no single technique can outperform all the others significantly. We also find an uneven distribution of research interest and resource allocation to the individual tasks in media bias. A unified benchmark encourages the development of more robust systems and shifts the current paradigm in media bias detection evaluation towards solutions that tackle not one but multiple media bias types simultaneously.

READ FULL TEXT
research
05/15/2020

Uncovering Gender Bias in Media Coverage of Politicians with Machine Learning

This paper presents research uncovering systematic gender bias in the re...
research
11/07/2022

Exploiting Transformer-based Multitask Learning for the Detection of Media Bias in News Articles

Media has a substantial impact on the public perception of events. A one...
research
04/01/2019

Multi-Task Ordinal Regression for Jointly Predicting the Trustworthiness and the Leading Political Ideology of News Media

In the context of fake news, bias, and propaganda, we study two importan...
research
04/06/2015

QUOTUS: The Structure of Political Media Coverage as Revealed by Quoting Patterns

Given the extremely large pool of events and stories available, media ou...
research
05/22/2022

A Domain-adaptive Pre-training Approach for Language Bias Detection in News

Media bias is a multi-faceted construct influencing individual behavior ...
research
01/28/2023

Bipol: Multi-axes Evaluation of Bias with Explainability in Benchmark Datasets

We evaluate five English NLP benchmark datasets (available on the superG...
research
05/31/2019

Can We Derive Explicit and Implicit Bias from Corpus?

Language is a popular resource to mine speakers' attitude bias, supposin...

Please sign up or login with your details

Forgot password? Click here to reset