Structured Stochastic Gradient MCMC

07/19/2021
by   Antonios Alexos, et al.
0

Stochastic gradient Markov chain Monte Carlo (SGMCMC) is considered the gold standard for Bayesian inference in large-scale models, such as Bayesian neural networks. Since practitioners face speed versus accuracy tradeoffs in these models, variational inference (VI) is often the preferable option. Unfortunately, VI makes strong assumptions on both the factorization and functional form of the posterior. In this work, we propose a new non-parametric variational approximation that makes no assumptions about the approximate posterior's functional form and allows practitioners to specify the exact dependencies the algorithm should respect or break. The approach relies on a new Langevin-type algorithm that operates on a modified energy function, where parts of the latent variables are averaged over samples from earlier iterations of the Markov chain. This way, statistical dependencies can be broken in a controlled way, allowing the chain to mix faster. This scheme can be further modified in a ”dropout” manner, leading to even more scalability. By implementing the scheme on a ResNet-20 architecture, we obtain better predictive likelihoods and larger effective sample sizes than full SGMCMC.

READ FULL TEXT
research
10/23/2014

Markov Chain Monte Carlo and Variational Inference: Bridging the Gap

Recent advances in stochastic gradient variational inference have made i...
research
10/16/2015

Scalable MCMC for Mixed Membership Stochastic Blockmodels

We propose a stochastic gradient Markov chain Monte Carlo (SG-MCMC) algo...
research
10/02/2020

MCMC-Interactive Variational Inference

Leveraging well-established MCMC strategies, we propose MCMC-interactive...
research
09/25/2020

Stein Variational Gaussian Processes

We show how to use Stein variational gradient descent (SVGD) to carry ou...
research
12/20/2022

Variational Factorization Machines for Preference Elicitation in Large-Scale Recommender Systems

Factorization machines (FMs) are a powerful tool for regression and clas...
research
12/31/2015

Distributed Bayesian Learning with Stochastic Natural-gradient Expectation Propagation and the Posterior Server

This paper makes two contributions to Bayesian machine learning algorith...
research
02/17/2023

Piecewise Deterministic Markov Processes for Bayesian Neural Networks

Inference on modern Bayesian Neural Networks (BNNs) often relies on a va...

Please sign up or login with your details

Forgot password? Click here to reset