We investigate how the training curve of isotropic kernel methods depend...
Two distinct limits for deep learning as the net width h→∞ have been
pro...
How many training data are needed to learn a supervised task? It is ofte...
We provide a description for the evolution of the generalization perform...
We argue that in fully-connected networks a phase transition delimits th...
Deep learning has been immensely successful at a variety of tasks, rangi...