NELA-GT-2021: A Large Multi-Labelled News Dataset for The Study of Misinformation in News Articles

03/10/2022
by   Mauricio Gruppi, et al.
0

In this paper, we present the fourth installment of the NELA-GT datasets, NELA-GT-2021. The dataset contains 1.8M articles from 367 outlets between January 1st, 2021 and December 31st, 2021. Just as in past releases of the dataset, NELA-GT-2021 includes outlet-level veracity labels from Media Bias/Fact Check and tweets embedded in collected news articles. The NELA-GT-2021 dataset can be found at: https://doi.org/10.7910/DVN/RBKVBM

READ FULL TEXT

page 2

page 5

research
02/08/2021

NELA-GT-2020: A Large Multi-Labelled News Dataset for The Study of Misinformation in News Articles

In this paper, we present an updated version of the NELA-GT-2019 dataset...
research
12/22/2022

MN-DS: A Multilabeled News Dataset for News Articles Hierarchical Classification

This article presents a dataset of 10,917 news articles with hierarchica...
research
08/06/2019

DpgMedia2019: A Dutch News Dataset for Partisanship Detection

We present a new Dutch news dataset with labeled partisanship. The datas...
research
01/29/2020

HoaxItaly: a collection of Italian disinformation and fact-checking stories shared on Twitter in 2019

We released over 1 million tweets shared during 2019 and containing link...
research
10/07/2022

Quantifying Political Bias in News Articles

Search bias analysis is getting more attention in recent years since sea...
research
12/17/2022

'If you build they will come': Automatic Identification of News-Stakeholders to detect Party Preference in News Coverage

The coverage of different stakeholders mentioned in the news articles si...
research
08/07/2023

Measuring Variety, Balance, and Disparity: An Analysis of Media Coverage of the 2021 German Federal Election

Determining and measuring diversity in news articles is important for a ...

Please sign up or login with your details

Forgot password? Click here to reset