Spatial-Temporal-Fusion BNN: Variational Bayesian Feature Layer

12/12/2021
by   Shiye Lei, et al.
0

Bayesian neural networks (BNNs) have become a principal approach to alleviate overconfident predictions in deep learning, but they often suffer from scaling issues due to a large number of distribution parameters. In this paper, we discover that the first layer of a deep network possesses multiple disparate optima when solely retrained. This indicates a large posterior variance when the first layer is altered by a Bayesian layer, which motivates us to design a spatial-temporal-fusion BNN (STF-BNN) for efficiently scaling BNNs to large models: (1) first normally train a neural network from scratch to realize fast training; and (2) the first layer is converted to Bayesian and inferred by employing stochastic variational inference, while other layers are fixed. Compared to vanilla BNNs, our approach can greatly reduce the training time and the number of parameters, which contributes to scale BNNs efficiently. We further provide theoretical guarantees on the generalizability and the capability of mitigating overconfidence of STF-BNN. Comprehensive experiments demonstrate that STF-BNN (1) achieves the state-of-the-art performance on prediction and uncertainty quantification; (2) significantly improves adversarial robustness and privacy preservation; and (3) considerably reduces training time and memory costs.

READ FULL TEXT
research
10/26/2020

Scalable Bayesian neural networks by layer-wise input augmentation

We introduce implicit Bayesian neural networks, a simple and scalable ap...
research
02/26/2019

Variational Inference to Measure Model Uncertainty in Deep Neural Networks

We present a novel approach for training deep neural networks in a Bayes...
research
11/03/2020

Amortized Variational Deep Q Network

Efficient exploration is one of the most important issues in deep reinfo...
research
11/15/2020

Efficient Variational Inference for Sparse Deep Learning with Theoretical Guarantee

Sparse deep learning aims to address the challenge of huge storage consu...
research
12/30/2021

A Lightweight and Accurate Spatial-Temporal Transformer for Traffic Forecasting

We study the forecasting problem for traffic with dynamic, possibly peri...
research
12/04/2020

Encoding the latent posterior of Bayesian Neural Networks for uncertainty quantification

Bayesian neural networks (BNNs) have been long considered an ideal, yet ...
research
05/01/2023

Bayesian system identification for structures considering spatial and temporal correlation

The decreasing cost and improved sensor and monitoring system technology...

Please sign up or login with your details

Forgot password? Click here to reset