Topic Modeling based on Keywords and Context

10/07/2017
by   Johannes Schneider, et al.
0

Current topic models often suffer from discovering topics not matching human intuition, unnatural switching of topics within documents and high computational demands. We address these concerns by proposing a topic model and an inference algorithm based on automatically identifying characteristic keywords for topics. Keywords influence topic-assignments of nearby words. Our algorithm learns (key)word-topic scores and it self-regulates the number of topics. Inference is simple and easily parallelizable. Qualitative analysis yields comparable results to state-of-the-art models (eg. LDA), but with different strengths and weaknesses. Quantitative analysis using 9 datasets shows gains in terms of classification accuracy, PMI score, computational performance and consistency of topic assignments within documents, while most often using less topics.

READ FULL TEXT
research
03/09/2022

Enhance Topics Analysis based on Keywords Properties

Topic Modelling is one of the most prevalent text analysis technique use...
research
01/22/2020

Keyword-based Topic Modeling and Keyword Selection

Certain type of documents such as tweets are collected by specifying a s...
research
02/06/2021

Exclusive Topic Modeling

We propose an Exclusive Topic Modeling (ETM) for unsupervised text class...
research
04/13/2020

Keyword Assisted Topic Models

For a long time, many social scientists have conducted content analysis ...
research
10/09/2017

Conic Scan-and-Cover algorithms for nonparametric topic modeling

We propose new algorithms for topic modeling when the number of topics i...
research
12/16/2019

Optimized Tracking of Topic Evolution

Topic evolution modeling has been researched for a long time and has gai...
research
02/19/2016

Scaling up Dynamic Topic Models

Dynamic topic models (DTMs) are very effective in discovering topics and...

Please sign up or login with your details

Forgot password? Click here to reset