Bayesian Prior Networks with PAC Training

06/03/2019
by   Manuel Haussmann, et al.
0

We propose to train Bayesian Neural Networks (BNNs) by empirical Bayes as an alternative to posterior weight inference. By approximately marginalizing out an i.i.d. realization of a finite number of sibling weights per data-point using the Central Limit Theorem (CLT), we attain a scalable and effective Bayesian deep predictor. This approach directly models the posterior predictive distribution, by-passing the intractable posterior weight inference step. However, it introduces a prohibitively large number of hyperparameters for stable training. As the prior weights are marginalized and hyperparameters are optimized, the model also no longer provides a means to incorporate prior knowledge. We overcome both of these drawbacks by deriving a trivial PAC bound that comprises the marginal likelihood of the predictor and a complexity penalty. The outcome integrates organically into the prior networks framework, bringing about an effective and holistic Bayesian treatment of prediction uncertainty. We observe on various regression, classification, and out-of-domain detection benchmarks that our scalable method provides an improved model fit accompanied with significantly better uncertainty estimates than the state-of-the-art.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/17/2020

Learning Partially Known Stochastic Dynamics with Empirical PAC Bayes

We propose a novel scheme for fitting heavily parameterized non-linear s...
research
02/04/2022

Demystify Optimization and Generalization of Over-parameterized PAC-Bayesian Learning

PAC-Bayesian is an analysis framework where the training error can be ex...
research
03/18/2019

Combining Model and Parameter Uncertainty in Bayesian Neural Networks

Bayesian neural networks (BNNs) have recently regained a significant amo...
research
10/19/2020

PAC^m-Bayes: Narrowing the Empirical Risk Gap in the Misspecified Bayesian Regime

While the decision-theoretic optimality of the Bayesian formalism under ...
research
07/17/2018

On the Beta Prime Prior for Scale Parameters in High-Dimensional Bayesian Regression Models

We study high-dimensional Bayesian linear regression with a general beta...
research
05/16/2022

Appropriate reduction of the posterior distribution in fully Bayesian inversions

Bayesian inversion generates a posterior distribution of model parameter...
research
08/25/2023

Causally Sound Priors for Binary Experiments

We introduce the BREASE framework for the Bayesian analysis of randomize...

Please sign up or login with your details

Forgot password? Click here to reset