L2M: Practical posterior Laplace approximation with optimization-driven second moment estimation

07/09/2021
by   Christian S. Perone, et al.
4

Uncertainty quantification for deep neural networks has recently evolved through many techniques. In this work, we revisit Laplace approximation, a classical approach for posterior approximation that is computationally attractive. However, instead of computing the curvature matrix, we show that, under some regularity conditions, the Laplace approximation can be easily constructed using the gradient second moment. This quantity is already estimated by many exponential moving average variants of Adagrad such as Adam and RMSprop, but is traditionally discarded after training. We show that our method (L2M) does not require changes in models or optimization, can be implemented in a few lines of code to yield reasonable results, and it does not require any extra computational steps besides what is already being computed by optimizers, without introducing any new hyperparameter. We hope our method can open new research directions on using quantities already computed by optimizers for uncertainty estimation in deep neural networks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/06/2020

Learnable Uncertainty under Laplace Approximations

Laplace approximations are classic, computationally lightweight means fo...
research
06/28/2021

Laplace Redux – Effortless Bayesian Deep Learning

Bayesian formulations of deep learning have been shown to have compellin...
research
11/05/2021

Mixtures of Laplace Approximations for Improved Post-Hoc Uncertainty in Deep Learning

Deep neural networks are prone to overconfident predictions on outliers....
research
02/24/2023

Variational Linearized Laplace Approximation for Bayesian Deep Learning

Pre-trained deep neural networks can be adapted to perform uncertainty e...
research
02/27/2013

Laplace's Method Approximations for Probabilistic Inference in Belief Networks with Continuous Variables

Laplace's method, a family of asymptotic methods used to approximate int...
research
06/12/2023

Riemannian Laplace approximations for Bayesian neural networks

Bayesian neural networks often approximate the weight-posterior with a G...
research
10/10/2022

Sampling-based inference for large linear models, with application to linearised Laplace

Large-scale linear models are ubiquitous throughout machine learning, wi...

Please sign up or login with your details

Forgot password? Click here to reset