Uncertainty-aware Topic Modeling Visualization

10/18/2021
by   Valerie Müller, et al.
0

Topic modeling is a state-of-the-art technique for analyzing text corpora. It uses a statistical model, most commonly Latent Dirichlet Allocation (LDA), to discover abstract topics that occur in the document collection. However, the LDA-based topic modeling procedure is based on a randomly selected initial configuration as well as a number of parameter values than need to be chosen. This induces uncertainties on the topic modeling results, and visualization methods should convey these uncertainties during the analysis process. We propose a visual uncertainty-aware topic modeling analysis. We capture the uncertainty by computing topic modeling ensembles and propose measures for estimating topic modeling uncertainty from the ensemble. Then, we propose to enhance state-of-the-art topic modeling visualization methods to convey the uncertainty in the topic modeling process. We visualize the entire ensemble of topic modeling results at different levels for topic and document analysis. We apply our visualization methods to a text corpus to document the impact of uncertainty on the analysis.

READ FULL TEXT

page 4

page 5

page 6

research
02/06/2021

Concentrated Document Topic Model

We propose a Concentrated Document Topic Model(CDTM) for unsupervised te...
research
10/16/2021

n-stage Latent Dirichlet Allocation: A Novel Approach for LDA

Nowadays, data analysis has become a problem as the amount of data is co...
research
05/23/2013

A Supervised Neural Autoregressive Topic Model for Simultaneous Image Classification and Annotation

Topic modeling based on latent Dirichlet allocation (LDA) has been a fra...
research
05/11/2020

SCAT: Second Chance Autoencoder for Textual Data

We present a k-competitive learning approach for textual autoencoders na...
research
02/23/2017

Stability of Topic Modeling via Matrix Factorization

Topic models can provide us with an insight into the underlying latent s...
research
05/24/2016

Computing Web-scale Topic Models using an Asynchronous Parameter Server

Topic models such as Latent Dirichlet Allocation (LDA) have been widely ...
research
10/29/2021

Word embeddings for topic modeling: an application to the estimation of the economic policy uncertainty index

Quantification of economic uncertainty is a key concept for the predicti...

Please sign up or login with your details

Forgot password? Click here to reset