An NLP approach to quantify dynamic salience of predefined topics in a text corpus

08/16/2021
by   A. Bock, et al.
0

The proliferation of news media available online simultaneously presents a valuable resource and significant challenge to analysts aiming to profile and understand social and cultural trends in a geographic location of interest. While an abundance of news reports documenting significant events, trends, and responses provides a more democratized picture of the social characteristics of a location, making sense of an entire corpus to extract significant trends is a steep challenge for any one analyst or team. Here, we present an approach using natural language processing techniques that seeks to quantify how a set of pre-defined topics of interest change over time across a large corpus of text. We found that, given a predefined topic, we can identify and rank sets of terms, or n-grams, that map to those topics and have usage patterns that deviate from a normal baseline. Emergence, disappearance, or significant variations in n-gram usage present a ground-up picture of a topic's dynamic salience within a corpus of interest.

READ FULL TEXT
research
06/22/2018

Using NLP on news headlines to predict index trends

This paper attempts to provide a state of the art in trend prediction us...
research
04/14/2020

Probabilistic Model of Narratives Over Topical Trends in Social Media: A Discrete Time Model

Online social media platforms are turning into the prime source of news ...
research
09/12/2019

Visualizing Trends of Key Roles in News Articles

There are tons of news articles generated every day reflecting the activ...
research
05/22/2023

A Diachronic Analysis of the NLP Research Paradigm Shift: When, How, and Why?

Understanding the fundamental concepts and trends in a scientific field ...
research
06/09/2022

Analyzing Folktales of Different Regions Using Topic Modeling and Clustering

This paper employs two major natural language processing techniques, top...
research
09/05/2020

Beyond Social Media Analytics: Understanding Human Behaviour and Deep Emotion using Self Structuring Incremental Machine Learning

This thesis develops a conceptual framework considering social data as r...
research
08/28/2019

Semantic Hypergraphs

Existing computational methods for the analysis of corpora of text in na...

Please sign up or login with your details

Forgot password? Click here to reset