Complex Critical Points of Deep Linear Neural Networks

01/30/2023
by   Ayush Bharadwaj, et al.
0

We extend the work of Mehta, Chen, Tang, and Hauenstein on computing the complex critical points of the loss function of deep linear neutral networks when the activation function is the identity function. For networks with a single hidden layer trained on a single data point we give an improved bound on the number of complex critical points of the loss function. We show that for any number of hidden layers complex critical points with zero coordinates arise in certain patterns which we completely classify for networks with one hidden layer. We report our results of computational experiments with varying network architectures defining small deep linear networks using HomotopyContinuation.jl.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/03/2019

Pure and Spurious Critical Points: a Geometric Study of Linear Networks

The critical locus of the loss function of a neural network is determine...
research
07/26/2023

Understanding Deep Neural Networks via Linear Separability of Hidden Layers

In this paper, we measure the linear separability of hidden layer output...
research
10/30/2017

Critical Points of Neural Networks: Analytical Forms and Landscape Properties

Due to the success of deep learning to solving a variety of challenging ...
research
07/05/2019

Weight-space symmetry in deep networks gives rise to permutation saddles, connected by equal-loss valleys across the loss landscape

The permutation symmetry of neurons in each layer of a deep neural netwo...
research
06/14/2019

Empirical study of extreme overfitting points of neural networks

In this paper we propose a method of obtaining points of extreme overfit...
research
04/12/2023

Function Space and Critical Points of Linear Convolutional Networks

We study the geometry of linear networks with one-dimensional convolutio...
research
09/04/2022

ASTra: A Novel Algorithm-Level Approach to Imbalanced Classification

We propose a novel output layer activation function, which we name ASTra...

Please sign up or login with your details

Forgot password? Click here to reset