Embedding Principle of Loss Landscape of Deep Neural Networks

05/30/2021
by   Yaoyu Zhang, et al.
0

Understanding the structure of loss landscape of deep neural networks (DNNs)is obviously important. In this work, we prove an embedding principle that the loss landscape of a DNN "contains" all the critical points of all the narrower DNNs. More precisely, we propose a critical embedding such that any critical point, e.g., local or global minima, of a narrower DNN can be embedded to a critical point/hyperplane of the target DNN with higher degeneracy and preserving the DNN output function. The embedding structure of critical points is independent of loss function and training data, showing a stark difference from other nonconvex problems such as protein-folding. Empirically, we find that a wide DNN is often attracted by highly-degenerate critical points that are embedded from narrow DNNs. The embedding principle provides an explanation for the general easy optimization of wide DNNs and unravels a potential implicit low-complexity regularization during the training. Overall, our work provides a skeleton for the study of loss landscape of DNNs and its implication, by which a more exact and comprehensive understanding can be anticipated in the near

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/26/2022

Embedding Principle in Depth for the Loss Landscape Analysis of Deep Neural Networks

Unraveling the general structure underlying the loss landscapes of deep ...
research
11/30/2021

Embedding Principle: a hierarchical structure of loss landscape of deep neural networks

We prove a general Embedding Principle of loss landscape of deep neural ...
research
03/19/2018

Comparing Dynamics: Deep Neural Networks versus Glassy Systems

We analyze numerically the training dynamics of deep neural networks (DN...
research
04/20/2021

Solution landscape of Onsager functional identifies non-axisymmetric critical points

We investigate the solution landscapes of the Onsager free-energy functi...
research
12/18/2014

On the Stability of Deep Networks

In this work we study the properties of deep neural networks (DNN) with ...
research
10/21/2020

Deep Neural Networks Are Congestion Games: From Loss Landscape to Wardrop Equilibrium and Beyond

The theoretical analysis of deep neural networks (DNN) is arguably among...
research
08/07/2023

G-Mix: A Generalized Mixup Learning Framework Towards Flat Minima

Deep neural networks (DNNs) have demonstrated promising results in vario...

Please sign up or login with your details

Forgot password? Click here to reset