Separation of scales and a thermodynamic description of feature learning in some CNNs

12/31/2021
by   Inbar Seroussi, et al.
0

Deep neural networks (DNNs) are powerful tools for compressing and distilling information. Due to their scale and complexity, often involving billions of inter-dependent internal degrees of freedom, exact analysis approaches often fall short. A common strategy in such cases is to identify slow degrees of freedom that average out the erratic behavior of the underlying fast microscopic variables. Here, we identify such a separation of scales occurring in over-parameterized deep convolutional neural networks (CNNs) at the end of training. It implies that neuron pre-activations fluctuate in a nearly Gaussian manner with a deterministic latent kernel. While for CNNs with infinitely many channels these kernels are inert, for finite CNNs they adapt and learn from data in an analytically tractable manner. The resulting thermodynamic theory of deep learning yields accurate predictions on several deep non-linear CNN toy models. In addition, it provides new ways of analyzing and understanding CNNs.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/08/2021

A self consistent theory of Gaussian Processes captures feature learning effects in finite CNNs

Deep neural networks (DNNs) in the infinite width/channel limit have rec...
research
05/13/2021

HiDeNN-PGD: reduced-order hierarchical deep learning neural networks

This paper presents a proper generalized decomposition (PGD) based reduc...
research
09/25/2019

Information Plane Analysis of Deep Neural Networks via Matrix-Based Renyi's Entropy and Tensor Kernels

Analyzing deep neural networks (DNNs) via information plane (IP) theory ...
research
03/15/2017

A Data Driven Approach for Compound Figure Separation Using Convolutional Neural Networks

A key problem in automatic analysis and understanding of scientific pape...
research
10/22/2019

Explicitly Bayesian Regularizations in Deep Learning

Generalization is essential for deep learning. In contrast to previous w...
research
07/11/2017

RegNet: Multimodal Sensor Registration Using Deep Neural Networks

In this paper, we present RegNet, the first deep convolutional neural ne...
research
11/12/2015

When Naïve Bayes Nearest Neighbours Meet Convolutional Neural Networks

Since Convolutional Neural Networks (CNNs) have become the leading learn...

Please sign up or login with your details

Forgot password? Click here to reset