Conic Scan-and-Cover algorithms for nonparametric topic modeling

10/09/2017
by   Mikhail Yurochkin, et al.
0

We propose new algorithms for topic modeling when the number of topics is unknown. Our approach relies on an analysis of the concentration of mass and angular geometry of the topic simplex, a convex polytope constructed by taking the convex hull of vertices representing the latent topics. Our algorithms are shown in practice to have accuracy comparable to a Gibbs sampler in terms of topic estimation, which requires the number of topics be given. Moreover, they are one of the fastest among several state of the art parametric techniques. Statistical consistency of our estimator is established under some conditions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/27/2016

Geometric Dirichlet Means algorithm for topic inference

We propose a geometric algorithm for topic learning and inference that i...
research
08/22/2012

A non-parametric mixture model for topic modeling over time

A single, stationary topic model such as latent Dirichlet allocation is ...
research
10/07/2017

Topic Modeling based on Keywords and Context

Current topic models often suffer from discovering topics not matching h...
research
10/08/2021

Learning Topic Models: Identifiability and Finite-Sample Analysis

Topic models provide a useful text-mining tool for learning, extracting ...
research
09/24/2018

Streaming dynamic and distributed inference of latent geometric structures

We develop new models and algorithms for learning the temporal dynamics ...
research
05/17/2018

A fast algorithm with minimax optimal guarantees for topic models with an unknown number of topics

We propose a new method of estimation in topic models, that is not a var...
research
02/19/2016

Scaling up Dynamic Topic Models

Dynamic topic models (DTMs) are very effective in discovering topics and...

Please sign up or login with your details

Forgot password? Click here to reset