Towards Efficient MCMC Sampling in Bayesian Neural Networks by Exploiting Symmetry

04/06/2023
by   Jonas Gregor Wiese, et al.
0

Bayesian inference in deep neural networks is challenging due to the high-dimensional, strongly multi-modal parameter posterior density landscape. Markov chain Monte Carlo approaches asymptotically recover the true posterior but are considered prohibitively expensive for large modern architectures. Local methods, which have emerged as a popular alternative, focus on specific parameter regions that can be approximated by functions with tractable integrals. While these often yield satisfactory empirical results, they fail, by definition, to account for the multi-modality of the parameter posterior. In this work, we argue that the dilemma between exact-but-unaffordable and cheap-but-inexact approaches can be mitigated by exploiting symmetries in the posterior landscape. Such symmetries, induced by neuron interchangeability and certain activation functions, manifest in different parameter values leading to the same functional output value. We show theoretically that the posterior predictive density in Bayesian neural networks can be restricted to a symmetry-free parameter reference set. By further deriving an upper bound on the number of Monte Carlo chains required to capture the functional diversity, we propose a straightforward approach for feasible Bayesian inference. Our experiments suggest that efficient sampling is indeed possible, opening up a promising path to accurate uncertainty quantification in deep learning.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/23/2018

Multi-core parallel tempering Bayeslands for basin and landscape evolution

In recent years, Bayesian inference has become a popular methodology for...
research
10/09/2011

Asymptotically Independent Markov Sampling: a new MCMC scheme for Bayesian Inference

In Bayesian statistics, many problems can be expressed as the evaluation...
research
03/19/2021

Semiparametric Bayesian Inference for Local Extrema of Functions in the Presence of Noise

There is a wide range of applications where the local extrema of a funct...
research
02/06/2020

How Good is the Bayes Posterior in Deep Neural Networks Really?

During the past five years the Bayesian deep learning community has deve...
research
07/24/2019

Transport Monte Carlo

In Bayesian inference, transport map is a promising alternative to the c...
research
05/24/2023

Masked Bayesian Neural Networks : Theoretical Guarantee and its Posterior Inference

Bayesian approaches for learning deep neural networks (BNN) have been re...
research
05/12/2022

Bayesian inference for stochastic oscillatory systems using the phase-corrected Linear Noise Approximation

Likelihood-based inference in stochastic non-linear dynamical systems, s...

Please sign up or login with your details

Forgot password? Click here to reset