Learning Structures for Deep Neural Networks

05/27/2021
by   Jinhui Yuan, et al.
0

In this paper, we focus on the unsupervised setting for structure learning of deep neural networks and propose to adopt the efficient coding principle, rooted in information theory and developed in computational neuroscience, to guide the procedure of structure learning without label information. This principle suggests that a good network structure should maximize the mutual information between inputs and outputs, or equivalently maximize the entropy of outputs under mild assumptions. We further establish connections between this principle and the theory of Bayesian optimal classification, and empirically verify that larger entropy of the outputs of a deep neural network indeed corresponds to a better classification accuracy. Then as an implementation of the principle, we show that sparse coding can effectively maximize the entropy of the output signals, and accordingly design an algorithm based on global group sparse coding to automatically learn the inter-layer connection and determine the depth of a neural network. Our experiments on a public image classification dataset demonstrate that using the structure learned from scratch by our proposed algorithm, one can achieve a classification accuracy comparable to the best expert-designed structure (i.e., convolutional neural networks (CNN)). In addition, our proposed algorithm successfully discovers the local connectivity (corresponding to local receptive fields in CNN) and invariance structure (corresponding to pulling in CNN), as well as achieves a good tradeoff between marginal performance gain and network depth.

READ FULL TEXT
research
06/24/2018

Constructing Deep Neural Networks by Bayesian Network Structure Learning

We introduce a principled approach for unsupervised structure learning o...
research
11/29/2017

Modeling Information Flow Through Deep Neural Networks

This paper proposes a principled information theoretic analysis of class...
research
04/20/2021

BraidNet: procedural generation of neural networks for image classification problems using braid theory

In this article, we propose the approach to procedural optimization of a...
research
03/26/2021

Explore the Knowledge contained in Network Weights to Obtain Sparse Neural Networks

Sparse neural networks are important for achieving better generalization...
research
05/25/2022

Entropy Maximization with Depth: A Variational Principle for Random Neural Networks

To understand the essential role of depth in neural networks, we investi...
research
08/17/2018

Dynamic Routing on Deep Neural Network for Thoracic Disease Classification and Sensitive Area Localization

We present and evaluate a new deep neural network architecture for autom...
research
09/07/2021

A Biologically Plausible Learning Rule for Perceptual Systems of organisms that Maximize Mutual Information

It is widely believed that the perceptual system of an organism is optim...

Please sign up or login with your details

Forgot password? Click here to reset