How News Evolves? Modeling News Text and Coverage using Graphs and Hawkes Process

11/18/2021
by   Honggen Zhang, et al.
0

Monitoring news content automatically is an important problem. The news content, unlike traditional text, has a temporal component. However, few works have explored the combination of natural language processing and dynamic system models. One reason is that it is challenging to mathematically model the nuances of natural language. In this paper, we discuss how we built a novel dataset of news articles collected over time. Then, we present a method of converting news text collected over time to a sequence of directed multi-graphs, which represent semantic triples (Subject ! Predicate ! Object). We model the dynamics of specific topological changes from these graphs using discrete-time Hawkes processes. With our real-world data, we show that analyzing the structures of the graphs and the discrete-time Hawkes process model can yield insights on how the news events were covered and how to predict how it may be covered in the future.

READ FULL TEXT
research
12/22/2022

MN-DS: A Multilabeled News Dataset for News Articles Hierarchical Classification

This article presents a dataset of 10,917 news articles with hierarchica...
research
10/25/2021

No News is Good News: A Critique of the One Billion Word Benchmark

The One Billion Word Benchmark is a dataset derived from the WMT 2011 Ne...
research
05/09/2021

News Kaleidoscope: Visual Investigation of Coverage Diversity in News Event Reporting

We develop a visual analytics system, NewsKaleidoscope, to investigate t...
research
06/14/2022

If it Bleeds, it Leads: A Computational Approach to Covering Crime in Los Angeles

Developing and improving computational approaches to covering news can i...
research
02/04/2014

Learning to Predict from Textual Data

Given a current news event, we tackle the problem of generating plausibl...
research
08/30/2023

Benchmarking Multilabel Topic Classification in the Kyrgyz Language

Kyrgyz is a very underrepresented language in terms of modern natural la...
research
08/28/2019

Semantic Hypergraphs

Existing computational methods for the analysis of corpora of text in na...

Please sign up or login with your details

Forgot password? Click here to reset