Understanding Deep Neural Networks via Linear Separability of Hidden Layers

07/26/2023
by   Chao Zhang, et al.
0

In this paper, we measure the linear separability of hidden layer outputs to study the characteristics of deep neural networks. In particular, we first propose Minkowski difference based linear separability measures (MD-LSMs) to evaluate the linear separability degree of two points sets. Then, we demonstrate that there is a synchronicity between the linear separability degree of hidden layer outputs and the network training performance, i.e., if the updated weights can enhance the linear separability degree of hidden layer outputs, the updated network will achieve a better training performance, and vice versa. Moreover, we study the effect of activation function and network size (including width and depth) on the linear separability of hidden layers. Finally, we conduct the numerical experiments to validate our findings on some popular deep networks including multilayer perceptron (MLP), convolutional neural network (CNN), deep belief network (DBN), ResNet, VGGNet, AlexNet, vision transformer (ViT) and GoogLeNet.

READ FULL TEXT
research
01/30/2023

Complex Critical Points of Deep Linear Neural Networks

We extend the work of Mehta, Chen, Tang, and Hauenstein on computing the...
research
06/09/2023

Hidden Classification Layers: a study on Data Hidden Representations with a Higher Degree of Linear Separability between the Classes

In the context of classification problems, Deep Learning (DL) approaches...
research
12/09/2015

Gamma Belief Networks

To infer multilayer deep representations of high-dimensional discrete an...
research
09/17/2018

Self Configuration in Machine Learning

In this paper we first present a class of algorithms for training multi-...
research
07/09/2017

Deepest Neural Networks

This paper shows that a long chain of perceptrons (that is, a multilayer...
research
12/04/2022

Understanding Sinusoidal Neural Networks

In this work, we investigate the representation capacity of multilayer p...
research
05/20/2018

Improved Learning of One-hidden-layer Convolutional Neural Networks with Overlaps

We propose a new algorithm to learn a one-hidden-layer convolutional neu...

Please sign up or login with your details

Forgot password? Click here to reset