Rethinking Sharpness-Aware Minimization as Variational Inference

10/19/2022
by   Szilvia Ujváry, et al.
0

Sharpness-aware minimization (SAM) aims to improve the generalisation of gradient-based learning by seeking out flat minima. In this work, we establish connections between SAM and Mean-Field Variational Inference (MFVI) of neural network parameters. We show that both these methods have interpretations as optimizing notions of flatness, and when using the reparametrisation trick, they both boil down to calculating the gradient at a perturbed version of the current mean parameter. This thinking motivates our study of algorithms that combine or interpolate between SAM and MFVI. We evaluate the proposed variational algorithms on several benchmark datasets, and compare their performance to variants of SAM. Taking a broader perspective, our work suggests that SAM-like updates can be used as a drop-in replacement for the reparametrisation trick.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/03/2023

A Tutorial on Parametric Variational Inference

Variational inference uses optimization, rather than integration, to app...
research
10/20/2022

On Representations of Mean-Field Variational Inference

The mean field variational inference (MFVI) formulation restricts the ge...
research
04/16/2014

Structured Stochastic Variational Inference

Stochastic variational inference makes it possible to approximate poster...
research
09/27/2018

Variance reduction properties of the reparameterization trick

The reparameterization trick is widely used in variational inference as ...
research
10/14/2022

On the Relationship Between Variational Inference and Auto-Associative Memory

In this article, we propose a variational inference formulation of auto-...
research
10/04/2019

Streamlined Variational Inference for Linear Mixed Models with Crossed Random Effects

We derive streamlined mean field variational Bayes algorithms for fittin...
research
06/05/2021

Energy-Based Learning for Cooperative Games, with Applications to Feature/Data/Model Valuations

Valuation problems, such as attribution-based feature interpretation, da...

Please sign up or login with your details

Forgot password? Click here to reset