Loss Surface Simplexes for Mode Connecting Volumes and Fast Ensembling

02/25/2021
by   Gregory W. Benton, et al.
1

With a better understanding of the loss surfaces for multilayer networks, we can build more robust and accurate training procedures. Recently it was discovered that independently trained SGD solutions can be connected along one-dimensional paths of near-constant training loss. In this paper, we show that there are mode-connecting simplicial complexes that form multi-dimensional manifolds of low loss, connecting many independently trained models. Inspired by this discovery, we show how to efficiently build simplicial complexes for fast ensembling, outperforming independently trained deep ensembles in accuracy, calibration, and robustness to dataset shift. Notably, our approach only requires a few training epochs to discover a low-loss simplex, starting from a pre-trained solution. Code is available at https://github.com/g-benton/loss-surface-simplexes.

READ FULL TEXT

page 4

page 5

page 6

page 11

page 15

research
02/27/2018

Loss Surfaces, Mode Connectivity, and Fast Ensembling of DNNs

The loss functions of deep neural networks are complex and their geometr...
research
06/18/2018

Using Mode Connectivity for Loss Landscape Analysis

Mode connectivity is a recently introduced frame- work that empirically ...
research
03/02/2023

Multi-Head Multi-Loss Model Calibration

Delivering meaningful uncertainty estimates is essential for a successfu...
research
04/30/2020

Bridging Mode Connectivity in Loss Landscapes and Adversarial Robustness

Mode connectivity provides novel geometric insights on analyzing loss la...
research
10/25/2022

Exploring Mode Connectivity for Pre-trained Language Models

Recent years have witnessed the prevalent application of pre-trained lan...
research
10/09/2019

Loss Surface Sightseeing by Multi-Point Optimization

We present multi-point optimization: an optimization technique that allo...
research
05/24/2022

Linear Connectivity Reveals Generalization Strategies

It is widely accepted in the mode connectivity literature that when two ...

Please sign up or login with your details

Forgot password? Click here to reset