The Training Process of Many Deep Networks Explores the Same Low-Dimensional Manifold

05/02/2023
by   Jialin Mao, et al.
0

We develop information-geometric techniques to analyze the trajectories of the predictions of deep networks during training. By examining the underlying high-dimensional probabilistic models, we reveal that the training process explores an effectively low-dimensional manifold. Networks with a wide range of architectures, sizes, trained using different optimization methods, regularization techniques, data augmentation techniques, and weight initializations lie on the same manifold in the prediction space. We study the details of this manifold to find that networks with different architectures follow distinguishable trajectories but other factors have a minimal influence; larger networks train along a similar manifold as that of smaller networks, just faster; and networks initialized at very different parts of the prediction space converge to the solution along a similar manifold.

READ FULL TEXT

page 18

page 23

research
02/15/2016

Efficient Representation of Low-Dimensional Manifolds using Deep Networks

We consider the ability of deep neural networks to represent data that l...
research
10/31/2022

A picture of the space of typical learnable tasks

We develop a technique to analyze representations learned by deep networ...
research
05/25/2017

Jeffrey's prior sampling of deep sigmoidal networks

Neural networks have been shown to have a remarkable ability to uncover ...
research
05/13/2023

Grasping Extreme Aerodynamics on a Low-Dimensional Manifold

Modern air vehicles perform a wide range of operations, including transp...
research
11/03/2020

Doubly Robust Off-Policy Learning on Low-Dimensional Manifolds by Deep Neural Networks

Causal inference explores the causation between actions and the conseque...
research
05/26/2023

Generalizing Adam To Manifolds For Efficiently Training Transformers

One of the primary reasons behind the success of neural networks has bee...
research
11/16/2017

LDMNet: Low Dimensional Manifold Regularized Neural Networks

Deep neural networks have proved very successful on archetypal tasks for...

Please sign up or login with your details

Forgot password? Click here to reset