Challenges in Bayesian inference via Markov chain Monte Carlo for neural networks

10/15/2019
by   Theodore Papamarkou, et al.
0

Markov chain Monte Carlo (MCMC) methods and neural networks are instrumental in tackling inferential and prediction problems. However, Bayesian inference based on joint use of MCMC methods and of neural networks is limited. This paper reviews the main challenges posed by neural networks to MCMC developments, including lack of parameter identifiability due to weight symmetries, prior specification effects, and consequently high computational cost and convergence failure. Population and manifold MCMC algorithms are combined to demonstrate these challenges via multilayer perceptron (MLP) examples and to develop case studies for assessing the capacity of approximate inference methods to uncover the posterior covariance of neural network parameters. Some of these challenges, such as high computational cost arising from the application of neural networks to big data and parameter identifiability arising from weight symmetries, stimulate research towards more scalable approximate MCMC methods or towards MCMC methods in reduced parameter spaces.

READ FULL TEXT
research
07/16/2019

Stochastic gradient Markov chain Monte Carlo

Markov chain Monte Carlo (MCMC) algorithms are generally regarded as the...
research
10/06/2022

Approximate Methods for Bayesian Computation

Rich data generating mechanisms are ubiquitous in this age of informatio...
research
03/01/2020

Markov Chain Monte Carlo with Neural Network Surrogates: Application to Contaminant Source Identification

Subsurface remediation often involves reconstruction of contaminant rele...
research
06/11/2021

DG-LMC: A Turn-key and Scalable Synchronous Distributed MCMC Algorithm via Langevin Monte Carlo within Gibbs

Performing reliable Bayesian inference on a big data scale is becoming a...
research
10/26/2021

Highly Scalable Maximum Likelihood and Conjugate Bayesian Inference for ERGMs on Graph Sets with Equivalent Vertices

The exponential family random graph modeling (ERGM) framework provides a...
research
06/09/2022

Damage Identification in Fiber Metal Laminates using Bayesian Analysis with Model Order Reduction

Fiber metal laminates (FML) are composite structures consisting of metal...
research
06/18/2015

Hamiltonian Monte Carlo Acceleration Using Surrogate Functions with Random Bases

For big data analysis, high computational cost for Bayesian methods ofte...

Please sign up or login with your details

Forgot password? Click here to reset