A non-parametric mixture model for topic modeling over time

08/22/2012
by   Avinava Dubey, et al.
0

A single, stationary topic model such as latent Dirichlet allocation is inappropriate for modeling corpora that span long time periods, as the popularity of topics is likely to change over time. A number of models that incorporate time have been proposed, but in general they either exhibit limited forms of temporal variation, or require computationally expensive inference methods. In this paper we propose non-parametric Topics over Time (npTOT), a model for time-varying topics that allows an unbounded number of topics and exible distribution over the temporal variations in those topics' popularity. We develop a collapsed Gibbs sampler for the proposed model and compare against existing models on synthetic and real document sets.

READ FULL TEXT
POST COMMENT

Comments

There are no comments yet.

Authors

page 1

page 2

page 3

page 4

04/23/2020

A Gamma-Poisson Mixture Topic Model for Short Text

Most topic models are constructed under the assumption that documents fo...
03/15/2012

Timeline: A Dynamic Hierarchical Dirichlet Process Model for Recovering Birth/Death and Evolution of Topics in Text Stream

Topic models have proven to be a useful tool for discovering latent stru...
10/09/2017

Conic Scan-and-Cover algorithms for nonparametric topic modeling

We propose new algorithms for topic modeling when the number of topics i...
11/27/2012

A simple non-parametric Topic Mixture for Authors and Documents

This article reviews the Author-Topic Model and presents a new non-param...
02/28/2013

Continuous-time Infinite Dynamic Topic Models

Topic models are probabilistic models for discovering topical themes in ...
06/28/2020

Mapping Topic Evolution Across Poetic Traditions

Poetic traditions across languages evolved differently, but we find that...
11/18/2017

The Cultural Evolution of National Constitutions

We explore how ideas from infectious disease and genetics can be used to...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.