Exclusive Topic Modeling

02/06/2021
by   Hao Lei, et al.
0

We propose an Exclusive Topic Modeling (ETM) for unsupervised text classification, which is able to 1) identify the field-specific keywords though less frequently appeared and 2) deliver well-structured topics with exclusive words. In particular, a weighted Lasso penalty is imposed to reduce the dominance of the frequently appearing yet less relevant words automatically, and a pairwise Kullback-Leibler divergence penalty is used to implement topics separation. Simulation studies demonstrate that the ETM detects the field-specific keywords, while LDA fails. When applying to the benchmark NIPS dataset, the topic coherence score on average improves by 22 model with weighted Lasso penalty and pairwise Kullback-Leibler divergence penalty, respectively.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/22/2014

Parsimonious Topic Models with Salient Word Discovery

We propose a parsimonious topic model for text corpora. In related model...
research
10/07/2017

Topic Modeling based on Keywords and Context

Current topic models often suffer from discovering topics not matching h...
research
07/06/2023

S2vNTM: Semi-supervised vMF Neural Topic Modeling

Language model based methods are powerful techniques for text classifica...
research
03/09/2022

Enhance Topics Analysis based on Keywords Properties

Topic Modelling is one of the most prevalent text analysis technique use...
research
12/07/2015

Jointly Modeling Topics and Intents with Global Order Structure

Modeling document structure is of great importance for discourse analysi...
research
10/14/2020

On Cross-Dataset Generalization in Automatic Detection of Online Abuse

NLP research has attained high performances in abusive language detectio...
research
06/05/2019

Spatial automatic subgroup analysis for areal data with repeated measures

We consider the subgroup analysis problem for spatial areal data with re...

Please sign up or login with your details

Forgot password? Click here to reset