Rethinking generalization requires revisiting old ideas: statistical mechanics approaches and complex learning behavior

10/26/2017
by   Charles H. Martin, et al.
0

We describe an approach to understand the peculiar and counterintuitive generalization properties of deep neural networks. The approach involves going beyond worst-case theoretical capacity control frameworks that have been popular in machine learning in recent years to revisit old ideas in the statistical mechanics of neural networks. Within this approach, we present a prototypical Very Simple Deep Learning (VSDL) model, whose behavior is controlled by two control parameters, one describing an effective amount of data, or load, on the network (that decreases when noise is added to the input), and one with an effective temperature interpretation (that increases when algorithms are early stopped). Using this model, we describe how a very simple application of ideas from the statistical mechanics theory of generalization provides a strong qualitative description of recently-observed empirical results regarding the inability of deep neural networks not to overfit training data, discontinuous learning and sharp transitions in the generalization properties of learning algorithms, etc.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/22/2020

Good linear classifiers are abundant in the interpolating regime

Within the machine learning community, the widely-used uniform convergen...
research
12/16/2020

Using noise resilience for ranking generalization of deep neural networks

Recent papers have shown that sufficiently overparameterized neural netw...
research
06/23/2020

Statistical Mechanics of Generalization in Kernel Regression

Generalization beyond a training dataset is a main goal of machine learn...
research
05/18/2021

Deep learning for solution and inversion of structural mechanics and vibrations

Deep learning has been the most popular machine learning method in the l...
research
07/23/2021

Taxonomizing local versus global structure in neural network loss landscapes

Viewing neural network models in terms of their loss landscapes has a lo...
research
09/08/2023

Generalization Bounds: Perspectives from Information Theory and PAC-Bayes

A fundamental question in theoretical machine learning is generalization...
research
07/23/2006

Ideas by Statistical Mechanics (ISM)

Ideas by Statistical Mechanics (ISM) is a generic program to model evolu...

Please sign up or login with your details

Forgot password? Click here to reset