NELA-GT-2020: A Large Multi-Labelled News Dataset for The Study of Misinformation in News Articles

02/08/2021
by   Mauricio Gruppi, et al.
0

In this paper, we present an updated version of the NELA-GT-2019 dataset, entitled NELA-GT-2020. NELA-GT-2020 contains nearly 1.8M news articles from 519 sources collected between January 1st, 2020 and December 31st, 2020. Just as with NELA-GT-2018 and NELA-GT-2019, these sources come from a wide range of mainstream news sources and alternative news sources. Included in the dataset are source-level ground truth labels from Media Bias/Fact Check (MBFC) covering multiple dimensions of veracity. Additionally, new in the 2020 dataset are the Tweets embedded in the collected news articles, adding an extra layer of information to the data. The NELA-GT-2020 dataset can be found at https://doi.org/10.7910/DVN/CHMUYZ.

READ FULL TEXT

page 3

page 5

research
03/18/2020

NELA-GT-2019: A Large Multi-Labelled News Dataset for The Study of Misinformation in News Articles

In this paper, we present an updated version of the NELA-GT-2018 dataset...
research
03/10/2022

NELA-GT-2021: A Large Multi-Labelled News Dataset for The Study of Misinformation in News Articles

In this paper, we present the fourth installment of the NELA-GT datasets...
research
03/21/2022

DIANES: A DEI Audit Toolkit for News Sources

Professional news media organizations have always touted the importance ...
research
05/15/2018

An Exploration of Verbatim Content Republishing by News Producers

In today's news ecosystem, news sources emerge frequently and can vary w...
research
12/16/2022

Fine-grained Czech News Article Dataset: An Interdisciplinary Approach to Trustworthiness Analysis

We present the Verifee Dataset: a novel dataset of news articles with fi...
research
04/19/2021

"Don't quote me on that": Finding Mixtures of Sources in News Articles

Journalists publish statements provided by people, or sources to context...
research
07/09/2020

CompRes: A Dataset for Narrative Structure in News

This paper addresses the task of automatically detecting narrative struc...

Please sign up or login with your details

Forgot password? Click here to reset