Comparing Dynamics: Deep Neural Networks versus Glassy Systems

03/19/2018
by   M. Baity-Jesi, et al.
0

We analyze numerically the training dynamics of deep neural networks (DNN) by using methods developed in statistical physics of glassy systems. The two main issues we address are the complexity of the loss-landscape and of the dynamics within it, and to what extent DNNs share similarities with glassy systems. Our findings, obtained for different architectures and datasets, suggest that during the training process the dynamics slows down because of an increasingly large number of flat directions. At large times, when the loss is approaching zero, the system diffuses at the bottom of the landscape. Despite some similarities with the dynamics of mean-field glassy systems, in particular, the absence of barrier crossing, we find distinctive dynamical behaviors in the two cases, showing that the statistical properties of the corresponding loss and energy landscapes are different. In contrast, when the network is under-parametrized we observe a typical glassy behavior, thus suggesting the existence of different phases depending on whether the network is under-parametrized or over-parametrized.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/25/2022

Understanding the Difficulty of Training Physics-Informed Neural Networks on Dynamical Systems

Physics-informed neural networks (PINNs) seamlessly integrate data and p...
research
05/30/2021

Embedding Principle of Loss Landscape of Deep Neural Networks

Understanding the structure of loss landscape of deep neural networks (D...
research
09/22/2020

Anomalous diffusion dynamics of learning in deep neural networks

Learning in deep neural networks (DNNs) is implemented through minimizin...
research
11/03/2019

Mean-field inference methods for neural networks

Machine learning algorithms relying on deep neural networks recently all...
research
12/16/2021

Visualizing the Loss Landscape of Winning Lottery Tickets

The underlying loss landscapes of deep neural networks have a great impa...
research
06/19/2020

An analytic theory of shallow networks dynamics for hinge loss classification

Neural networks have been shown to perform incredibly well in classifica...
research
07/18/2023

The semantic landscape paradigm for neural networks

Deep neural networks exhibit a fascinating spectrum of phenomena ranging...

Please sign up or login with your details

Forgot password? Click here to reset