Nearest Class-Center Simplification through Intermediate Layers

01/21/2022
by   Ido Ben-Shaul, et al.
3

Recent advances in theoretical Deep Learning have introduced geometric properties that occur during training, past the Interpolation Threshold – where the training error reaches zero. We inquire into the phenomena coined Neural Collapse in the intermediate layers of the networks, and emphasize the innerworkings of Nearest Class-Center Mismatch inside the deepnet. We further show that these processes occur both in vision and language model architectures. Lastly, we propose a Stochastic Variability-Simplification Loss (SVSL) that encourages better geometrical features in intermediate layers, and improves both train metrics and generalization.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/18/2020

Prevalence of Neural Collapse during the terminal phase of deep learning training

Modern practice for training classification deepnets involves a Terminal...
research
09/18/2020

σ^2R Loss: a Weighted Loss by Multiplicative Factors using Sigmoidal Functions

In neural networks, the loss function represents the core of the learnin...
research
10/29/2022

Perturbation Analysis of Neural Collapse

Training deep neural networks for classification often includes minimizi...
research
03/24/2019

Generalization of k-means Related Algorithms

This article briefly introduced Arthur and Vassilvitshii's work on k-mea...
research
12/11/2017

DeepConfig: Automating Data Center Network Topologies Management with Machine Learning

In recent years, many techniques have been developed to improve the perf...
research
06/08/2022

Neural Collapse: A Review on Modelling Principles and Generalization

With a recent observation of the "Neural Collapse (NC)" phenomena by Pap...
research
09/17/2018

Intermediate Deep Feature Compression: the Next Battlefield of Intelligent Sensing

The recent advances of hardware technology have made the intelligent ana...

Please sign up or login with your details

Forgot password? Click here to reset