Text2Time: Transformer-based Article Time Period Prediction

The task of predicting the publication period of text documents, such as news articles, is an important but less studied problem in the field of natural language processing. Predicting the year of a news article can be useful in various contexts, such as historical research, sentiment analysis, and media monitoring. In this work, we investigate the problem of predicting the publication period of a text document, specifically a news article, based on its textual content. In order to do so, we created our own extensive labeled dataset of over 350,000 news articles published by The New York Times over six decades. In our approach, we use a pretrained BERT model fine-tuned for the task of text classification, specifically for time period prediction.This model exceeds our expectations and provides some very impressive results in terms of accurately classifying news articles into their respective publication decades. The results beat the performance of the baseline model for this relatively unexplored task of time prediction from text.

READ FULL TEXT

page 4

page 6

page 7

page 8

research
12/22/2022

MN-DS: A Multilabeled News Dataset for News Articles Hierarchical Classification

This article presents a dataset of 10,917 news articles with hierarchica...
research
12/14/2018

Measuring Similarity: Computationally Reproducing the Scholar's Interests

Computerized document classification already orders the news articles th...
research
07/05/2023

Emoji Prediction using Transformer Models

In recent years, the use of emojis in social media has increased dramati...
research
01/24/2022

Classification Of Fake News Headline Based On Neural Networks

Over the last few years, Text classification is one of the fundamental t...
research
05/13/2019

On the share of mathematics published by Elsevier and Springer

For-profit editors such as Elsevier and Springer have been subject to su...
research
03/23/2016

BreakingNews: Article Annotation by Image and Text Processing

Building upon recent Deep Neural Network architectures, current approach...
research
10/03/2012

Logical segmentation for article extraction in digitized old newspapers

Newspapers are documents made of news item and informative articles. The...

Please sign up or login with your details

Forgot password? Click here to reset