Viewpoint and Topic Modeling of Current Events

08/14/2016
by   Kerry Zhang, et al.
0

There are multiple sides to every story, and while statistical topic models have been highly successful at topically summarizing the stories in corpora of text documents, they do not explicitly address the issue of learning the different sides, the viewpoints, expressed in the documents. In this paper, we show how these viewpoints can be learned completely unsupervised and represented in a human interpretable form. We use a novel approach of applying CorrLDA2 for this purpose, which learns topic-viewpoint relations that can be used to form groups of topics, where each group represents a viewpoint. A corpus of documents about the Israeli-Palestinian conflict is then used to demonstrate how a Palestinian and an Israeli viewpoint can be learned. By leveraging the magnitudes and signs of the feature weights of a linear SVM, we introduce a principled method to evaluate associations between topics and viewpoints. With this, we demonstrate, both quantitatively and qualitatively, that the learned topic groups are contextually coherent, and form consistently correct topic-viewpoint associations.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/19/2022

Human in the loop: How to effectively create coherent topics by manually labeling only a few documents per class

Few-shot methods for accurate modeling under sparse label-settings have ...
research
06/07/2023

Effective Neural Topic Modeling with Embedding Clustering Regularization

Topic models have been prevalent for decades with various applications. ...
research
08/08/2019

Assessing Sentiment of the Expressed Stance on Social Media

Stance detection is the task of inferring viewpoint towards a given topi...
research
03/31/2021

Topic Scaling: A Joint Document Scaling – Topic Model Approach To Learn Time-Specific Topics

This paper proposes a new methodology to study sequential corpora by imp...
research
09/06/2015

Sampled Weighted Min-Hashing for Large-Scale Topic Mining

We present Sampled Weighted Min-Hashing (SWMH), a randomized approach to...
research
07/03/2018

Topic Discovery in Massive Text Corpora Based on Min-Hashing

The task of discovering topics in text corpora has been dominated by Lat...
research
09/19/2021

Co-occurrence of medical conditions: Exposing patterns through probabilistic topic modeling of SNOMED codes

Patients associated with multiple co-occurring health conditions often f...

Please sign up or login with your details

Forgot password? Click here to reset