Exact marginal inference in Latent Dirichlet Allocation

03/31/2020
by   Hartmut Maennel, et al.
0

Assume we have potential "causes" z∈ Z, which produce "events" w with known probabilities β(w|z). We observe w_1,w_2,...,w_n, what can we say about the distribution of the causes? A Bayesian estimate will assume a prior on distributions on Z (we assume a Dirichlet prior) and calculate a posterior. An average over that posterior then gives a distribution on Z, which estimates how much each cause z contributed to our observations. This is the setting of Latent Dirichlet Allocation, which can be applied e.g. to topics "producing" words in a document. In this setting usually the number of observed words is large, but the number of potential topics is small. We are here interested in applications with many potential "causes" (e.g. locations on the globe), but only a few observations. We show that the exact Bayesian estimate can be computed in linear time (and constant space) in |Z| for a given upper bound on n with a surprisingly simple formula. We generalize this algorithm to the case of sparse probabilities β(w|z), in which we only need to assume that the tree width of an "interaction graph" on the observations is limited. On the other hand we also show that without such limitation the problem is NP-hard.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/30/2012

A Spectral Algorithm for Latent Dirichlet Allocation

The problem of topic modeling can be seen as a generalization of the clu...
research
05/12/2016

An Empirical-Bayes Score for Discrete Bayesian Networks

Bayesian network structure learning is often performed in a Bayesian set...
research
06/27/2012

Sparse Stochastic Inference for Latent Dirichlet allocation

We present a hybrid algorithm for Bayesian topic models that combines th...
research
05/09/2012

Computing Posterior Probabilities of Structural Features in Bayesian Networks

We study the problem of learning Bayesian network structures from data. ...
research
07/05/2018

An MCMC Approach to Empirical Bayes Inference and Bayesian Sensitivity Analysis via Empirical Processes

Consider a Bayesian situation in which we observe Y ∼ p_θ, where θ∈Θ, an...
research
03/10/2016

Bayesian Opponent Exploitation in Imperfect-Information Games

Two fundamental problems in computational game theory are computing a Na...
research
05/16/2018

Deconvolution of dust mixtures by latent Dirichlet allocation in forensic science

Dust particles recovered from the soles of shoes may be indicative of th...

Please sign up or login with your details

Forgot password? Click here to reset