MN-DS: A Multilabeled News Dataset for News Articles Hierarchical Classification

12/22/2022
by   Alina Petukhova, et al.
0

This article presents a dataset of 10,917 news articles with hierarchical news categories collected between January 1st 2019, and December 31st 2019. We manually labelled the articles based on a hierarchical taxonomy with 17 first-level and 109 second-level categories. This dataset can be used to train machine learning models for automatically classifying news articles by topic. This dataset can be helpful for researchers working on news structuring, classification, and predicting future events based on released news.

READ FULL TEXT
research
03/10/2022

NELA-GT-2021: A Large Multi-Labelled News Dataset for The Study of Misinformation in News Articles

In this paper, we present the fourth installment of the NELA-GT datasets...
research
08/27/2018

Models for Predicting Community-Specific Interest in News Articles

In this work, we ask two questions: 1. Can we predict the type of commun...
research
04/21/2023

Text2Time: Transformer-based Article Time Period Prediction

The task of predicting the publication period of text documents, such as...
research
02/21/2018

Investigating Rumor News Using Agreement-Aware Search

In recent years, rumor news has been generated by humans as well as robo...
research
02/04/2014

Learning to Predict from Textual Data

Given a current news event, we tackle the problem of generating plausibl...
research
11/18/2021

How News Evolves? Modeling News Text and Coverage using Graphs and Hawkes Process

Monitoring news content automatically is an important problem. The news ...
research
11/21/2019

An Empirical Study of Sections in Classifying Disease Outbreak Reports

Identifying articles that relate to infectious diseases is a necessary s...

Please sign up or login with your details

Forgot password? Click here to reset