A Practical Approach to Sizing Neural Networks

10/04/2018
by   Gerald Friedland, et al.
0

Memorization is worst-case generalization. Based on MacKay's information theoretic model of supervised machine learning, this article discusses how to practically estimate the maximum size of a neural network given a training data set. First, we present four easily applicable rules to analytically determine the capacity of neural network architectures. This allows the comparison of the efficiency of different network architectures independently of a task. Second, we introduce and experimentally validate a heuristic method to estimate the neural network capacity requirement for a given dataset and labeling. This allows an estimate of the required size of a neural network for a given problem. We conclude the article with a discussion on the consequences of sizing the network wrongly, which includes both increased computation effort for training as well as reduced generalization capability.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/11/2019

Weight Agnostic Neural Networks

Not all neural network architectures are created equal, some perform muc...
research
03/26/2021

Generalization capabilities of translationally equivariant neural networks

The rising adoption of machine learning in high energy physics and latti...
research
02/12/2019

Capacity allocation analysis of neural networks: A tool for principled architecture design

Designing neural network architectures is a task that lies somewhere bet...
research
09/15/2017

Dynamic Capacity Estimation in Hopfield Networks

Understanding the memory capacity of neural networks remains a challengi...
research
05/31/2021

Persistent Homology Captures the Generalization of Neural Networks Without A Validation Set

The training of neural networks is usually monitored with a validation (...
research
08/13/2018

iNNvestigate neural networks!

In recent years, deep neural networks have revolutionized many applicati...
research
12/23/2021

Equivariance and generalization in neural networks

The crucial role played by the underlying symmetries of high energy phys...

Please sign up or login with your details

Forgot password? Click here to reset