Noisy Natural Gradient as Variational Inference

12/06/2017
by   Guodong Zhang, et al.
0

Combining the flexibility of deep learning with Bayesian uncertainty estimation has long been a goal in our field, and many modern approaches are based on variational Bayes. Unfortunately, one is forced to choose between overly simplistic variational families (e.g. fully factorized) or expensive and complicated inference procedures. We show that natural gradient ascent with adaptive weight noise can be interpreted as fitting a variational posterior to maximize the evidence lower bound (ELBO). This insight allows us to train full covariance, fully factorized, and matrix variate Gaussian variational posteriors using noisy versions of natural gradient, Adam, and K-FAC, respectively. On standard regression benchmarks, our noisy K-FAC algorithm makes better predictions and matches HMC's predictive variances better than existing methods. Its improved uncertainty estimates lead to more efficient exploration in the settings of active learning and intrinsic motivation for reinforcement learning.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/30/2018

Eigenvalue Corrected Noisy Natural Gradient

Variational Bayesian neural networks combine the flexibility of deep lea...
research
06/11/2021

Posterior Temperature Optimization in Variational Inference

Cold posteriors have been reported to perform better in practice in the ...
research
06/10/2021

Quantum Natural Gradient for Variational Bayes

Variational Bayes (VB) is a critical method in machine learning and stat...
research
06/13/2018

Fast and Scalable Bayesian Deep Learning by Weight-Perturbation in Adam

Uncertainty computation in deep learning is essential to design robust a...
research
11/15/2017

Variational Adaptive-Newton Method for Explorative Learning

We present the Variational Adaptive Newton (VAN) method which is a black...
research
10/26/2022

Exact Manifold Gaussian Variational Bayes

We propose an optimization algorithm for Variational Inference (VI) in c...
research
08/20/2019

Learning document embeddings along with their uncertainties

Majority of the text modelling techniques yield only point estimates of ...

Please sign up or login with your details

Forgot password? Click here to reset