Bayesian Layers: A Module for Neural Network Uncertainty

12/10/2018
by   Dustin Tran, et al.
14

We describe Bayesian Layers, a module designed for fast experimentation with neural network uncertainty. It extends neural network libraries with layers capturing uncertainty over weights (Bayesian neural nets), pre-activation units (dropout), activations ("stochastic output layers"), and the function itself (Gaussian processes). With reversible layers, one can also propagate uncertainty from input to output such as for flow-based distributions and constant-memory backpropagation. Bayesian Layers are a drop-in replacement for other layers, maintaining core features that one typically desires for experimentation. As demonstration, we fit a 10-billion parameter "Bayesian Transformer" on 512 TPUv2 cores, which replaces attention layers with their Bayesian counterpart.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/04/2022

Variational Neural Networks

Bayesian Neural Networks (BNNs) provide a tool to estimate the uncertain...
research
05/17/2020

Global inducing point variational posteriors for Bayesian neural networks and deep Gaussian processes

Variational inference is a popular approach to reason about uncertainty ...
research
12/02/2019

Differential Bayesian Neural Nets

Neural Ordinary Differential Equations (N-ODEs) are a powerful building ...
research
02/18/2022

Out of Distribution Data Detection Using Dropout Bayesian Neural Networks

We explore the utility of information contained within a dropout based B...
research
11/29/2018

The Relevance of Bayesian Layer Positioning to Model Uncertainty in Deep Bayesian Active Learning

One of the main challenges of deep learning tools is their inability to ...
research
07/10/2020

Characteristics of Monte Carlo Dropout in Wide Neural Networks

Monte Carlo (MC) dropout is one of the state-of-the-art approaches for u...
research
08/29/2020

Loss convergence in a causal Bayesian neural network of retail firm performance

We extend the empirical results from the structural equation model (SEM)...

Please sign up or login with your details

Forgot password? Click here to reset