PAC-Bayes training for neural networks: sparsity and uncertainty quantification

04/26/2022
by   Maximilian F. Steffen, et al.
0

We study the Gibbs posterior distribution from PAC-Bayes theory for sparse deep neural nets in a nonparametric regression setting. To access the posterior distribution, an efficient MCMC algorithm based on backpropagation is constructed. The training yields a Bayesian neural network with a joint distribution on the network parameters. Using a mixture over uniform priors on sparse sets of networks weights, we prove an oracle inequality which shows that the method adapts to the unknown regularity and hierarchical structure of the regression function. Studying the Gibbs posterior distribution from a frequentist Bayesian perspective, we analyze the diameter and show high coverage probability of the resulting credible sets. The method is illustrated in a simulation example.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/05/2018

Empirical priors and coverage of posterior credible sets in a sparse normal mean model

Bayesian methods provide a natural means for uncertainty quantification,...
research
10/02/2019

Robust Bayesian Regression with Synthetic Posterior

Regression models are fundamental tools in statistics, but they typicall...
research
11/08/2018

Practical Bayesian Learning of Neural Networks via Adaptive Subgradient Methods

We introduce a novel framework for the estimation of the posterior distr...
research
06/26/2019

Chaining Meets Chain Rule: Multilevel Entropic Regularization and Training of Neural Nets

We derive generalization and excess risk bounds for neural nets using a ...
research
06/02/2022

Masked Bayesian Neural Networks : Computation and Optimality

As data size and computing power increase, the architectures of deep neu...
research
06/08/2016

DISCO Nets: DISsimilarity COefficient Networks

We present a new type of probabilistic model which we call DISsimilarity...
research
12/09/2020

Uncertainty quantification for fault slip inversion

We propose an efficient Bayesian approach to infer a fault displacement ...

Please sign up or login with your details

Forgot password? Click here to reset