Low-dimensional Embeddings for Interpretable Anchor-based Topic Inference

11/18/2017
by   Moontae Lee, et al.
0

The anchor words algorithm performs provably efficient topic model inference by finding an approximate convex hull in a high-dimensional word co-occurrence space. However, the existing greedy algorithm often selects poor anchor words, reducing topic quality and interpretability. Rather than finding an approximate convex hull in a high-dimensional space, we propose to find an exact convex hull in a visualizable 2- or 3-dimensional space. Such low-dimensional embeddings both improve topics and clearly show users why the algorithm selects certain words.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/10/2019

A New Anchor Word Selection Method for the Separable Topic Discovery

Separable Non-negative Matrix Factorization (SNMF) is an important metho...
research
06/18/2019

Interactive Topic Modeling with Anchor Words

The formalism of anchor words has enabled the development of fast topic ...
research
07/01/2017

Efficient Correlated Topic Modeling with Topic Embedding

Correlated topic modeling has been limited to small model and problem si...
research
11/15/2016

Anchor-Free Correlated Topic Modeling: Identifiability and Algorithm

In topic modeling, many algorithms that guarantee identifiability of the...
research
12/02/2017

Survival-Supervised Topic Modeling with Anchor Words: Characterizing Pancreatitis Outcomes

We introduce a new approach for topic modeling that is supervised by sur...
research
12/01/2017

Prediction-Constrained Topic Models for Antidepressant Recommendation

Supervisory signals can help topic models discover low-dimensional data ...
research
06/29/2014

Estimating the distribution of Galaxy Morphologies on a continuous space

The incredible variety of galaxy shapes cannot be summarized by human de...

Please sign up or login with your details

Forgot password? Click here to reset