Parallel Tempering With a Variational Reference

05/31/2022
by   Nikola Surjanovic, et al.
0

Sampling from complex target distributions is a challenging task fundamental to Bayesian inference. Parallel tempering (PT) addresses this problem by constructing a Markov chain on the expanded state space of a sequence of distributions interpolating between the posterior distribution and a fixed reference distribution, which is typically chosen to be the prior. However, in the typical case where the prior and posterior are nearly mutually singular, PT methods are computationally prohibitive. In this work we address this challenge by constructing a generalized annealing path connecting the posterior to an adaptively tuned variational reference. The reference distribution is tuned to minimize the forward (inclusive) KL divergence to the posterior distribution using a simple, gradient-free moment-matching procedure. We show that our adaptive procedure converges to the forward KL minimizer, and that the forward KL divergence serves as a good proxy to a previously developed measure of PT performance. We also show that in the large-data limit in typical Bayesian models, the proposed method improves in performance, while traditional PT deteriorates arbitrarily. Finally, we introduce PT with two references – one fixed, one variational – with a novel split annealing path that ensures stable variational reference adaptation. The paper concludes with experiments that demonstrate the large empirical gains achieved by our method in a wide range of realistic Bayesian inference scenarios.

READ FULL TEXT
research
06/30/2021

Variational Refinement for Importance Sampling Using the Forward Kullback-Leibler Divergence

Variational Inference (VI) is a popular alternative to asymptotically ex...
research
04/16/2021

On the Robustness to Misspecification of α-Posteriors and Their Variational Approximations

α-posteriors and their variational approximations distort standard poste...
research
10/12/2022

On Divergence Measures for Bayesian Pseudocoresets

A Bayesian pseudocoreset is a small synthetic dataset for which the post...
research
02/15/2021

Parallel Tempering on Optimized Paths

Parallel tempering (PT) is a class of Markov chain Monte Carlo algorithm...
research
07/15/2023

Minimal Random Code Learning with Mean-KL Parameterization

This paper studies the qualitative behavior and robustness of two varian...
research
06/23/2021

Sampling with Mirrored Stein Operators

We introduce a new family of particle evolution samplers suitable for co...
research
11/26/2009

A Bayesian Rule for Adaptive Control based on Causal Interventions

Explaining adaptive behavior is a central problem in artificial intellig...

Please sign up or login with your details

Forgot password? Click here to reset