Deep Latent Dirichlet Allocation with Topic-Layer-Adaptive Stochastic Gradient Riemannian MCMC

06/06/2017
by   Yulai Cong, et al.
1

It is challenging to develop stochastic gradient based scalable inference for deep discrete latent variable models (LVMs), due to the difficulties in not only computing the gradients, but also adapting the step sizes to different latent factors and hidden layers. For the Poisson gamma belief network (PGBN), a recently proposed deep discrete LVM, we derive an alternative representation that is referred to as deep latent Dirichlet allocation (DLDA). Exploiting data augmentation and marginalization techniques, we derive a block-diagonal Fisher information matrix and its inverse for the simplex-constrained global model parameters of DLDA. Exploiting that Fisher information matrix with stochastic gradient MCMC, we present topic-layer-adaptive stochastic gradient Riemannian (TLASGR) MCMC that jointly learns simplex-constrained global parameters across all layers and topics, with topic and layer specific learning rates. State-of-the-art results are demonstrated on big data sets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/04/2018

WHAI: Weibull Hybrid Autoencoding Inference for Deep Topic Modeling

To train an inference network jointly with a deep generative topic model...
research
06/30/2021

Sawtooth Factorial Topic Embeddings Guided Gamma Belief Network

Hierarchical topic models such as the gamma belief network (GBN) have de...
research
06/22/2023

Efficient preconditioned stochastic gradient descent for estimation in latent variable models

Latent variable models are powerful tools for modeling complex phenomena...
research
02/05/2020

AdaGeo: Adaptive Geometric Learning for Optimization and Sampling

Gradient-based optimization and Markov Chain Monte Carlo sampling can be...
research
03/22/2019

Scalable Data Augmentation for Deep Learning

Scalable Data Augmentation (SDA) provides a framework for training deep ...
research
01/09/2019

Dirichlet Variational Autoencoder

This paper proposes Dirichlet Variational Autoencoder (DirVAE) using a D...
research
05/23/2023

Optimal Preconditioning and Fisher Adaptive Langevin Sampling

We define an optimal preconditioning for the Langevin diffusion by analy...

Please sign up or login with your details

Forgot password? Click here to reset