On the emergence of tetrahedral symmetry in the final and penultimate layers of neural network classifiers

12/10/2020
by   Weinan E, et al.
5

A recent numerical study observed that neural network classifiers enjoy a large degree of symmetry in the penultimate layer. Namely, if h(x) = Af(x) +b where A is a linear map and f is the output of the penultimate layer of the network (after activation), then all data points x_i, 1, …, x_i, N_i in a class C_i are mapped to a single point y_i by f and the points y_i are located at the vertices of a regular k-1-dimensional tetrahedron in a high-dimensional Euclidean space. We explain this observation analytically in toy models for highly expressive deep neural networks. In complementary examples, we demonstrate rigorously that even the final output of the classifier h is not uniform over data samples from a class C_i if h is a shallow network (or if the deeper layers do not bring the data samples into a convenient geometric configuration).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/05/2023

Neural Collapse in the Intermediate Hidden Layers of Classification Neural Networks

Neural Collapse (NC) gives a precise description of the representations ...
research
09/06/2017

BranchyNet: Fast Inference via Early Exiting from Deep Neural Networks

Deep neural networks are state of the art methods for many learning task...
research
12/17/2021

A singular Riemannian geometry approach to Deep Neural Networks II. Reconstruction of 1-D equivalence classes

In a previous work, we proposed a geometric framework to study a deep ne...
research
07/05/2019

Weight-space symmetry in deep networks gives rise to permutation saddles, connected by equal-loss valleys across the loss landscape

The permutation symmetry of neurons in each layer of a deep neural netwo...
research
12/14/2021

Identifying Class Specific Filters with L1 Norm Frequency Histograms in Deep CNNs

Interpretability of Deep Neural Networks has become a major area of expl...
research
11/09/2015

Symmetries and control in generative neural nets

We study generative nets which can control and modify observations, afte...
research
02/22/2018

Vector Field Based Neural Networks

A novel Neural Network architecture is proposed using the mathematically...

Please sign up or login with your details

Forgot password? Click here to reset