A statistical theory of cold posteriors in deep neural networks

08/13/2020
by   Laurence Aitchison, et al.
0

To get Bayesian neural networks to perform comparably to standard neural networks it is usually necessary to artificially reduce uncertainty using a "tempered" or "cold" posterior. This is extremely concerning: if the prior is accurate, Bayes inference/decision theory is optimal, and any artificial changes to the posterior should harm performance. While this suggests that the prior may be at fault, here we argue that in fact, BNNs for image classification use the wrong likelihood. In particular, standard image benchmark datasets such as CIFAR-10 are carefully curated. We develop a generative model describing curation which gives a principled Bayesian account of cold posteriors, because the likelihood under this new generative model closely matches the tempered likelihoods used in past work.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/24/2021

A statistical theory of out-of-distribution detection

We introduce a principled approach to detecting out-of-distribution (OOD...
research
08/13/2020

A statistical theory of semi-supervised learning

We currently lack a solid statistical understanding of semi-supervised l...
research
02/06/2020

How Good is the Bayes Posterior in Deep Neural Networks Really?

During the past five years the Bayesian deep learning community has deve...
research
02/27/2022

Towards Unifying Logical Entailment and Statistical Estimation

This paper gives a generative model of the interpretation of formal logi...
research
09/12/2019

Learning Bayes' theorem with a neural network for gravitational-wave inference

We wish to achieve the Holy Grail of Bayesian inference with deep-learni...
research
05/27/2022

How Tempering Fixes Data Augmentation in Bayesian Neural Networks

While Bayesian neural networks (BNNs) provide a sound and principled alt...
research
12/12/2019

Diagnosing model misspecification and performing generalized Bayes' updates via probabilistic classifiers

Model misspecification is a long-standing enigma of the Bayesian inferen...

Please sign up or login with your details

Forgot password? Click here to reset