Gradient Regularisation as Approximate Variational Inference

11/20/2020
by   Ali Unlu, et al.
0

Variational inference in Bayesian neural networks is usually performed using stochastic sampling which gives very high-variance gradients, and hence slow learning. Here, we show that it is possible to obtain a deterministic approximation of the ELBO for a Bayesian neural network by doing a Taylor-series expansion around the mean of the current variational distribution. The resulting approximate ELBO is the training-log-likelihood plus a squared gradient regulariser. In addition to learning the approximate posterior variance, we also consider a uniform-variance approximate posterior, inspired by the stationary distribution of SGD. The corresponding approximate ELBO has a simple form, as the log-likelihood plus a simple squared-gradient regulariser. We argue that this squared-gradient regularisation may at the root of the excellent empirical performance of SGD.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/27/2021

Variational Laplace for Bayesian neural networks

We develop variational Laplace for Bayesian neural networks (BNNs) which...
research
06/09/2019

Note on the bias and variance of variational inference

In this note, we study the relationship between the variational gap and ...
research
11/11/2018

SLANG: Fast Structured Covariance Approximations for Bayesian Deep Learning with Natural Gradient

Uncertainty estimation in large deep-learning models is a computationall...
research
06/28/2016

Automatic Variational ABC

Approximate Bayesian Computation (ABC) is a framework for performing lik...
research
12/18/2022

Faithful Heteroscedastic Regression with Neural Networks

Heteroscedastic regression models a Gaussian variable's mean and varianc...
research
04/06/2015

Early Stopping is Nonparametric Variational Inference

We show that unconverged stochastic gradient descent can be interpreted ...
research
11/13/2019

Error bounds for some approximate posterior measures in Bayesian inference

In certain applications involving the solution of a Bayesian inverse pro...

Please sign up or login with your details

Forgot password? Click here to reset