Transport Score Climbing: Variational Inference Using Forward KL and Adaptive Neural Transport

02/03/2022
by   Liyi Zhang, et al.
2

Variational inference often minimizes the "reverse" Kullbeck-Leibler (KL) KL(q||p) from the approximate distribution q to the posterior p. Recent work studies the "forward" KL KL(p||q), which unlike reverse KL does not lead to variational approximations that underestimate uncertainty. This paper introduces Transport Score Climbing (TSC), a method that optimizes KL(p||q) by using Hamiltonian Monte Carlo (HMC) and a novel adaptive transport map. The transport map improves the trajectory of HMC by acting as a change of variable between the latent variable space and a warped space. TSC uses HMC samples to dynamically train the transport map while optimizing KL(p||q). TSC leverages synergies, where better transport maps lead to better HMC sampling, which then leads to better transport maps. We demonstrate TSC on synthetic and real data. We find that TSC achieves competitive performance when training variational autoencoders on large-scale data.

READ FULL TEXT

page 8

page 13

research
03/09/2019

NeuTra-lizing Bad Geometry in Hamiltonian Monte Carlo Using Neural Transport

Hamiltonian Monte Carlo is a powerful algorithm for sampling from diffic...
research
07/03/2023

Transport, Variational Inference and Diffusions: with Applications to Annealed Flows and Schrödinger Bridges

This paper explores the connections between optimal transport and variat...
research
03/23/2020

Markovian Score Climbing: Variational Inference with KL(p||q)

Modern variational inference (VI) uses stochastic gradients to avoid int...
research
02/04/2021

Variational Inference for Deblending Crowded Starfields

In the image data collected by astronomical surveys, stars and galaxies ...
research
01/17/2022

Analytic-DPM: an Analytic Estimate of the Optimal Reverse Variance in Diffusion Probabilistic Models

Diffusion probabilistic models (DPMs) represent a class of powerful gene...
research
07/17/2022

Gradients should stay on Path: Better Estimators of the Reverse- and Forward KL Divergence for Normalizing Flows

We propose an algorithm to estimate the path-gradient of both the revers...
research
08/09/2021

Pathfinder: Parallel quasi-Newton variational inference

We introduce Pathfinder, a variational method for approximately sampling...

Please sign up or login with your details

Forgot password? Click here to reset