An Instability in Variational Inference for Topic Models

02/02/2018
by   Behrooz Ghorbani, et al.
0

Topic models are Bayesian models that are frequently used to capture the latent structure of certain corpora of documents or images. Each data element in such a corpus (for instance each item in a collection of scientific articles) is regarded as a convex combination of a small number of vectors corresponding to `topics' or `components'. The weights are assumed to have a Dirichlet prior distribution. The standard approach towards approximating the posterior is to use variational inference algorithms, and in particular a mean field approximation. We show that this approach suffers from an instability that can produce misleading conclusions. Namely, for certain regimes of the model parameters, variational inference outputs a non-trivial decomposition into topics. However --for the same parameter values-- the data contain no actual information about the true decomposition, and hence the output of the algorithm is uncorrelated with the true topic decomposition. Among other consequences, the estimated posterior mean is significantly wrong, and estimated Bayesian credible regions do not achieve the nominal coverage. We discuss how this instability is remedied by more accurate mean field approximations.

READ FULL TEXT

page 13

page 15

page 20

page 22

research
01/13/2020

Conditional Variational Inference with Adaptive Truncation for Bayesian Nonparametric Models

The scalable inference for Bayesian nonparametric models with big data i...
research
09/19/2012

Variational Inference in Nonconjugate Models

Mean-field variational methods are widely used for approximate posterior...
research
11/04/2019

Statistical Inference in Mean-Field Variational Bayes

We conduct non-asymptotic analysis on the mean-field variational inferen...
research
06/01/2017

Discovering Discrete Latent Topics with Neural Variational Inference

Topic models have been widely explored as probabilistic generative model...
research
02/10/2020

Try Depth Instead of Weight Correlations: Mean-field is a Less Restrictive Assumption for Deeper Networks

We challenge the longstanding assumption that the mean-field approximati...
research
07/11/2012

Graph partition strategies for generalized mean field inference

An autonomous variational inference algorithm for arbitrary graphical mo...
research
11/06/2018

A Variational Inference Algorithm for BKMR in the Cross-Sectional Setting

The identification of pollutant effects is an important task in environm...

Please sign up or login with your details

Forgot password? Click here to reset