Timeline: A Dynamic Hierarchical Dirichlet Process Model for Recovering Birth/Death and Evolution of Topics in Text Stream

03/15/2012
by   Amr Ahmed, et al.
0

Topic models have proven to be a useful tool for discovering latent structures in document collections. However, most document collections often come as temporal streams and thus several aspects of the latent structure such as the number of topics, the topics' distribution and popularity are time-evolving. Several models exist that model the evolution of some but not all of the above aspects. In this paper we introduce infinite dynamic topic models, iDTM, that can accommodate the evolution of all the aforementioned aspects. Our model assumes that documents are organized into epochs, where the documents within each epoch are exchangeable but the order between the documents is maintained across epochs. iDTM allows for unbounded number of topics: topics can die or be born at any epoch, and the representation of each topic can evolve according to a Markovian dynamics. We use iDTM to analyze the birth and evolution of topics in the NIPS community and evaluated the efficacy of our model on both simulated and real datasets with favorable outcome.

READ FULL TEXT
research
02/28/2013

Continuous-time Infinite Dynamic Topic Models

Topic models are probabilistic models for discovering topical themes in ...
research
05/06/2018

Dynamic and Static Topic Model for Analyzing Time-Series Document Collections

For extracting meaningful topics from texts, their structures should be ...
research
02/03/2023

ANTM: An Aligned Neural Topic Model for Exploring Evolving Topics

As the amount of text data generated by humans and machines increases, t...
research
11/15/2017

Deep Temporal-Recurrent-Replicated-Softmax for Topical Trends over Time

Dynamic topic modeling facilitates the identification of topical trends ...
research
06/23/2021

Recurrent Coupled Topic Modeling over Sequential Documents

The abundant sequential documents such as online archival, social media ...
research
09/26/2014

Topic Similarity Networks: Visual Analytics for Large Document Sets

We investigate ways in which to improve the interpretability of LDA topi...
research
10/08/2021

Learning Topic Models: Identifiability and Finite-Sample Analysis

Topic models provide a useful text-mining tool for learning, extracting ...

Please sign up or login with your details

Forgot password? Click here to reset