Layer Adaptive Node Selection in Bayesian Neural Networks: Statistical Guarantees and Implementation Details

08/25/2021
by   Sanket Jantre, et al.
0

Sparse deep neural networks have proven to be efficient for predictive model building in large-scale studies. Although several works have studied theoretical and numerical properties of sparse neural architectures, they have primarily focused on the edge selection. Sparsity through edge selection might be intuitively appealing; however, it does not necessarily reduce the structural complexity of a network. Instead pruning excessive nodes in each layer leads to a structurally sparse network which would have lower computational complexity and memory footprint. We propose a Bayesian sparse solution using spike-and-slab Gaussian priors to allow for node selection during training. The use of spike-and-slab prior alleviates the need of an ad-hoc thresholding rule for pruning redundant nodes from a network. In addition, we adopt a variational Bayes approach to circumvent the computational challenges of traditional Markov Chain Monte Carlo (MCMC) implementation. In the context of node selection, we establish the fundamental result of variational posterior consistency together with the characterization of prior parameters. In contrast to the previous works, our theoretical development relaxes the assumptions of the equal number of nodes and uniform bounds on all network weights, thereby accommodating sparse networks with layer-dependent node structures or coefficient bounds. With a layer-wise characterization of prior inclusion probabilities, we also discuss optimal contraction rates of the variational posterior. Finally, we provide empirical evidence to substantiate that our theoretical work facilitates layer-wise optimal node recovery together with competitive predictive performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/17/2023

A comprehensive study of spike and slab shrinkage priors for structurally sparse Bayesian neural networks

Network complexity and computational efficiency have become increasingly...
research
11/15/2020

Efficient Variational Inference for Sparse Deep Learning with Theoretical Guarantee

Sparse deep learning aims to address the challenge of huge storage consu...
research
10/22/2020

Spike and slab variational Bayes for high dimensional logistic regression

Variational Bayes (VB) is a popular scalable alternative to Markov chain...
research
06/29/2020

Statistical Foundation of Variational Bayes Neural Networks

Despite the popularism of Bayesian neural networks in recent years, its ...
research
05/29/2017

Model Selection in Bayesian Neural Networks via Horseshoe Priors

Bayesian Neural Networks (BNNs) have recently received increasing attent...
research
06/13/2018

Structured Variational Learning of Bayesian Neural Networks with Horseshoe Priors

Bayesian Neural Networks (BNNs) have recently received increasing attent...
research
07/08/2020

Double spike Dirichlet priors for structured weighting

Assigning weights to a large pool of objects is a fundamental task in a ...

Please sign up or login with your details

Forgot password? Click here to reset