Capacity allocation analysis of neural networks: A tool for principled architecture design

02/12/2019
by   Jonathan Donier, et al.
0

Designing neural network architectures is a task that lies somewhere between science and art. For a given task, some architectures are eventually preferred over others, based on a mix of intuition, experience, experimentation and luck. For many tasks, the final word is attributed to the loss function, while for some others a further perceptual evaluation is necessary to assess and compare performance across models. In this paper, we introduce the concept of capacity allocation analysis, with the aim of shedding some light on what network architectures focus their modelling capacity on, when used on a given task. We focus more particularly on spatial capacity allocation, which analyzes a posteriori the effective number of parameters that a given model has allocated for modelling dependencies on a given point or region in the input space, in linear settings. We use this framework to perform a quantitative comparison between some classical architectures on various synthetic tasks. Finally, we consider how capacity allocation might translate in non-linear settings.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/22/2019

Capacity allocation through neural network layers

Capacity analysis has been recently introduced as a way to analyze how l...
research
10/04/2018

A Practical Approach to Sizing Neural Networks

Memorization is worst-case generalization. Based on MacKay's information...
research
03/26/2021

Generalization capabilities of translationally equivariant neural networks

The rising adoption of machine learning in high energy physics and latti...
research
10/04/2022

Polysemanticity and Capacity in Neural Networks

Individual neurons in neural networks often represent a mixture of unrel...
research
11/29/2016

Capacity and Trainability in Recurrent Neural Networks

Two potential bottlenecks on the expressiveness of recurrent neural netw...
research
08/25/2021

Opportunistic Emulation of Computationally Expensive Simulations via Deep Learning

With the underlying aim of increasing efficiency of computational modell...
research
06/11/2018

State Space Representations of Deep Neural Networks

This paper deals with neural networks as dynamical systems governed by d...

Please sign up or login with your details

Forgot password? Click here to reset