Learning Deep Generative Models with Short Run Inference Dynamics

12/04/2019
by   Erik Nijkamp, et al.
1

This paper studies the fundamental problem of learning deep generative models that consist of one or more layers of latent variables organized in top-down architectures. Learning such a generative model requires inferring the latent variables for each training example based on the posterior distribution of these latent variables. The inference typically requires Markov chain Monte Caro (MCMC) that can be time consuming. In this paper, we propose to use short run inference dynamics guided by the log-posterior, such as finite-step gradient descent algorithm initialized from the prior distribution of the latent variables, as an approximate sampler of the posterior distribution, where the step size of the gradient descent dynamics is optimized by minimizing the Kullback-Leibler divergence between the distribution produced by the short run inference dynamics and the posterior distribution. Our experiments show that the proposed method outperforms variational auto-encoder (VAE) in terms of reconstruction error and synthesis quality. The advantage of the proposed method is that it is natural and automatic, even for models with multiple layers of latent variables.

READ FULL TEXT

page 7

page 8

research
01/23/2023

A Tale of Two Latent Flows: Learning Latent Space Normalizing Flow with Short-run Langevin Flow for Approximate Inference

We study a normalizing flow in the latent space of a top-down generator ...
research
02/28/2016

A Structured Variational Auto-encoder for Learning Deep Hierarchies of Sparse Features

In this note we present a generative model of natural images consisting ...
research
07/09/2021

The Effects of Invertibility on the Representational Complexity of Encoders in Variational Autoencoders

Training and using modern neural-network based latent-variable generativ...
research
06/11/2014

Reweighted Wake-Sleep

Training deep directed graphical models with many hidden variables and p...
research
04/23/2022

Learning and Inference in Sparse Coding Models with Langevin Dynamics

We describe a stochastic, dynamical system capable of inference and lear...
research
10/25/2019

Attention for Inference Compilation

We present a new approach to automatic amortized inference in universal ...
research
08/11/2023

Hawkes Processes with Delayed Granger Causality

We aim to explicitly model the delayed Granger causal effects based on m...

Please sign up or login with your details

Forgot password? Click here to reset