Neural Collapse in the Intermediate Hidden Layers of Classification Neural Networks

08/05/2023
by   Liam Parker, et al.
0

Neural Collapse (NC) gives a precise description of the representations of classes in the final hidden layer of classification neural networks. This description provides insights into how these networks learn features and generalize well when trained past zero training error. However, to date, (NC) has only been studied in the final layer of these networks. In the present paper, we provide the first comprehensive empirical analysis of the emergence of (NC) in the intermediate hidden layers of these classifiers. We examine a variety of network architectures, activations, and datasets, and demonstrate that some degree of (NC) emerges in most of the intermediate hidden layers of the network, where the degree of collapse in any given layer is typically positively correlated with the depth of that layer in the neural network. Moreover, we remark that: (1) almost all of the reduction in intra-class variance in the samples occurs in the shallower layers of the networks, (2) the angular separation between class means increases consistently with hidden layer depth, and (3) simple datasets require only the shallower layers of the networks to fully learn them, whereas more difficult ones require the entire network. Ultimately, these results provide granular insights into the structural propagation of features through classification neural networks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/12/2023

Interpreting Hidden Semantics in the Intermediate Layers of 3D Point Cloud Classification Neural Network

Although 3D point cloud classification neural network models have been w...
research
12/10/2020

On the emergence of tetrahedral symmetry in the final and penultimate layers of neural network classifiers

A recent numerical study observed that neural network classifiers enjoy ...
research
05/03/2015

Making Sense of Hidden Layer Information in Deep Networks by Learning Hierarchical Targets

This paper proposes an architecture for deep neural networks with hidden...
research
10/29/2022

Perturbation Analysis of Neural Collapse

Training deep neural networks for classification often includes minimizi...
research
12/14/2021

Identifying Class Specific Filters with L1 Norm Frequency Histograms in Deep CNNs

Interpretability of Deep Neural Networks has become a major area of expl...
research
02/20/2018

i-RevNet: Deep Invertible Networks

It is widely believed that the success of deep convolutional networks is...
research
02/03/2016

Learning Discriminative Features via Label Consistent Neural Network

Deep Convolutional Neural Networks (CNN) enforces supervised information...

Please sign up or login with your details

Forgot password? Click here to reset