Flat Seeking Bayesian Neural Networks

02/06/2023
by   Van-Anh Nguyen, et al.
0

Bayesian Neural Networks (BNNs) offer a probabilistic interpretation for deep learning models by imposing a prior distribution over model parameters and inferencing a posterior distribution based on observed data. The model sampled from the posterior distribution can be used for providing ensemble predictions and quantifying prediction uncertainty. It is well-known that deep learning models with a lower sharpness have a better generalization ability. Nonetheless, existing posterior inferences are not aware of sharpness/flatness, hence possibly leading to high sharpness for the models sampled from it. In this paper, we develop theories, the Bayesian setting, and the variational inference approach for the sharpness-aware posterior. Specifically, the models sampled from our sharpness-aware posterior and the optimal approximate posterior estimating this sharpness-aware posterior have a better flatness, hence possibly possessing a higher generalization ability. We conduct experiments by leveraging the sharpness-aware posterior with the state-of-the-art Bayesian Neural Networks, showing that the flat-seeking counterparts outperform their baselines in all metrics of interest.

READ FULL TEXT
research
09/13/2019

Adversarial α-divergence Minimization for Bayesian Approximate Inference

Neural networks are popular models for regression. They are often traine...
research
12/23/2021

Latent Time Neural Ordinary Differential Equations

Neural ordinary differential equations (NODE) have been proposed as a co...
research
12/23/2021

Improving Robustness and Uncertainty Modelling in Neural Ordinary Differential Equations

Neural ordinary differential equations (NODE) have been proposed as a co...
research
02/21/2022

Non-Volatile Memory Accelerated Posterior Estimation

Bayesian inference allows machine learning models to express uncertainty...
research
06/09/2021

Loss function based second-order Jensen inequality and its application to particle variational inference

Bayesian model averaging, obtained as the expectation of a likelihood fu...
research
05/29/2022

Radial Spike and Slab Bayesian Neural Networks for Sparse Data in Ransomware Attacks

Ransomware attacks are increasing at an alarming rate, leading to large ...
research
07/26/2023

Identifiability and Falsifiability: Two Challenges for Bayesian Model Expansion

We study the identifiability of model parameters and falsifiability of m...

Please sign up or login with your details

Forgot password? Click here to reset