Expressive yet Tractable Bayesian Deep Learning via Subnetwork Inference

10/28/2020
by   Eric Nalisnick, et al.
0

The Bayesian paradigm has the potential to solve some of the core issues in modern deep learning, such as poor calibration, data inefficiency, and catastrophic forgetting. However, scaling Bayesian inference to the high-dimensional parameter spaces of deep neural networks requires restrictive approximations. In this paper, we propose performing inference over only a small subset of the model parameters while keeping all others as point estimates. This enables us to use expressive posterior approximations that would otherwise be intractable for the full model. In particular, we develop a practical and scalable Bayesian deep learning method that first trains a point estimate, and then infers a full covariance Gaussian posterior approximation over a subnetwork. We propose a subnetwork selection procedure which aims to optimally preserve posterior uncertainty. We empirically demonstrate the effectiveness of our approach compared to point-estimated networks and methods that use less expressive posterior approximations over the full network.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/06/2020

How Good is the Bayes Posterior in Deep Neural Networks Really?

During the past five years the Bayesian deep learning community has deve...
research
01/27/2023

A Deep Learning Method for Comparing Bayesian Hierarchical Models

Bayesian model comparison (BMC) offers a principled approach for assessi...
research
06/13/2021

Post-hoc loss-calibration for Bayesian neural networks

Bayesian decision theory provides an elegant framework for acting optima...
research
05/20/2018

Online Structured Laplace Approximations For Overcoming Catastrophic Forgetting

We introduce the Kronecker factored online Laplace approximation for ove...
research
04/20/2020

Tractable Approximate Gaussian Inference for Bayesian Neural Networks

In this paper, we propose an analytical method allowing for tractable ap...
research
01/29/2020

The Case for Bayesian Deep Learning

The key distinguishing property of a Bayesian approach is marginalizatio...
research
10/30/2020

Bayesian Optimization Meets Laplace Approximation for Robotic Introspection

In robotics, deep learning (DL) methods are used more and more widely, b...

Please sign up or login with your details

Forgot password? Click here to reset