Bayesian neural networks via MCMC: a Python-based tutorial

04/02/2023
by   Rohitash Chandra, et al.
0

Bayesian inference provides a methodology for parameter estimation and uncertainty quantification in machine learning and deep learning methods. Variational inference and Markov Chain Monte-Carlo (MCMC) sampling techniques are used to implement Bayesian inference. In the past three decades, MCMC methods have faced a number of challenges in being adapted to larger models (such as in deep learning) and big data problems. Advanced proposals that incorporate gradients, such as a Langevin proposal distribution, provide a means to address some of the limitations of MCMC sampling for Bayesian neural networks. Furthermore, MCMC methods have typically been constrained to use by statisticians and are still not prominent among deep learning researchers. We present a tutorial for MCMC methods that covers simple Bayesian linear and logistic models, and Bayesian neural networks. The aim of this tutorial is to bridge the gap between theory and implementation via coding, given a general sparsity of libraries and tutorials to this end. This tutorial provides code in Python with data and instructions that enable their use and extension. We provide results for some benchmark problems showing the strengths and weaknesses of implementing the respective Bayesian models via MCMC. We highlight the challenges in sampling multi-modal posterior distributions in particular for the case of Bayesian neural networks, and the need for further improvement of convergence diagnosis.

READ FULL TEXT

page 9

page 13

page 15

page 16

page 23

page 24

page 25

page 26

research
04/17/2021

Bayesian graph convolutional neural networks via tempered MCMC

Deep learning models, such as convolutional neural networks, have long b...
research
08/09/2019

Bayesian Inference for Large Scale Image Classification

Bayesian inference promises to ground and improve the performance of dee...
research
08/24/2022

The premise of approximate MCMC in Bayesian deep learning

This paper identifies several characteristics of approximate MCMC in Bay...
research
03/17/2023

Posterior Estimation Using Deep Learning: A Simulation Study of Compartmental Modeling in Dynamic PET

Background: In medical imaging, images are usually treated as determinis...
research
04/13/2021

Revisiting Bayesian Autoencoders with MCMC

Autoencoders gained popularity in the deep learning revolution given the...
research
02/08/2022

Impact of Parameter Sparsity on Stochastic Gradient MCMC Methods for Bayesian Deep Learning

Bayesian methods hold significant promise for improving the uncertainty ...
research
03/24/2022

Knowledge Removal in Sampling-based Bayesian Inference

The right to be forgotten has been legislated in many countries, but its...

Please sign up or login with your details

Forgot password? Click here to reset