The Dynamic Embedded Topic Model

07/12/2019
by   Adji B. Dieng, et al.
0

Topic modeling analyzes documents to learn meaningful patterns of words. Dynamic topic models capture how these patterns vary over time for a set of documents that were collected over a large time span. We develop the dynamic embedded topic model (D-ETM), a generative model of documents that combines dynamic latent Dirichlet allocation (D-LDA) and word embeddings. The D-ETM models each word with a categorical distribution whose parameter is given by the inner product between the word embedding and an embedding representation of its assigned topic at a particular time step. The word embeddings allow the D-ETM to generalize to rare words. The D-ETM learns smooth topic trajectories by defining a random walk prior over the embeddings of the topics. We fit the D-ETM using structured amortized variational inference. On a collection of United Nations debates, we find that the D-ETM learns interpretable topics and outperforms D-LDA in terms of both topic quality and predictive performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/08/2019

Topic Modeling in Embedding Spaces

Topic modeling analyzes documents to learn meaningful patterns of words....
research
05/01/2019

Nested Variational Autoencoder for Topic Modeling on Microtexts with Word Vectors

Most of the information on the Internet is represented in the form of mi...
research
10/25/2016

Scalable Dynamic Topic Modeling with Clustered Latent Dirichlet Allocation (CLDA)

Topic modeling, a method for extracting the underlying themes from a col...
research
01/26/2023

Neural Dynamic Focused Topic Model

Topic models and all their variants analyse text by learning meaningful ...
research
04/01/2016

Nonparametric Spherical Topic Modeling with Word Embeddings

Traditional topic models do not account for semantic regularities in lan...
research
06/13/2012

Continuous Time Dynamic Topic Models

In this paper, we develop the continuous time dynamic topic model (cDTM)...
research
11/30/2016

Anchored Correlation Explanation: Topic Modeling with Minimal Domain Knowledge

While generative models such as Latent Dirichlet Allocation (LDA) have p...

Please sign up or login with your details

Forgot password? Click here to reset