Diversity-Promoting Bayesian Learning of Latent Variable Models

11/23/2017
by   Pengtao Xie, et al.
0

To address three important issues involved in latent variable models (LVMs), including capturing infrequent patterns, achieving small-sized but expressive models and alleviating overfitting, several studies have been devoted to "diversifying" LVMs, which aim at encouraging the components in LVMs to be diverse. Most existing studies fall into a frequentist-style regularization framework, where the components are learned via point estimation. In this paper, we investigate how to "diversify" LVMs in the paradigm of Bayesian learning. We propose two approaches that have complementary advantages. One is to define a diversity-promoting mutual angular prior which assigns larger density to components with larger mutual angles and use this prior to affect the posterior via Bayes' rule. We develop two efficient approximate posterior inference algorithms based on variational inference and MCMC sampling. The other approach is to impose diversity-promoting regularization directly over the post-data distribution of components. We also extend our approach to "diversify" Bayesian nonparametric models where the number of components is infinite. A sampling algorithm based on slice sampling and Hamiltonian Monte Carlo is developed. We apply these methods to "diversify" Bayesian mixture of experts model and infinite latent feature model. Experiments on various datasets demonstrate the effectiveness and efficiency of our methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/08/2016

Online but Accurate Inference for Latent Variable Models with Local Gibbs Sampling

We study parameter inference in large-scale latent variable models. We f...
research
05/20/2022

Sparse Infinite Random Feature Latent Variable Modeling

We propose a non-linear, Bayesian non-parametric latent variable model w...
research
12/23/2015

Latent Variable Modeling with Diversity-Inducing Mutual Angular Regularization

Latent Variable Models (LVMs) are a large family of machine learning mod...
research
05/19/2017

Accelerated Inference for Latent Variable Models

Inference of latent feature models in the Bayesian nonparametric setting...
research
12/11/2019

Bayesian Copula Density Deconvolution for Zero-Inflated Data in Nutritional Epidemiology

Estimating the marginal and joint densities of the long-term average int...
research
09/15/2022

Langevin Autoencoders for Learning Deep Latent Variable Models

Markov chain Monte Carlo (MCMC), such as Langevin dynamics, is valid for...
research
04/07/2020

Repulsive Mixture Models of Exponential Family PCA for Clustering

The mixture extension of exponential family principal component analysis...

Please sign up or login with your details

Forgot password? Click here to reset