Modeling Information Flow Through Deep Neural Networks

11/29/2017
by   Ahmad Chaddad, et al.
0

This paper proposes a principled information theoretic analysis of classification for deep neural network structures, e.g. convolutional neural networks (CNN). The output of convolutional filters is modeled as a random variable Y conditioned on the object class C and network filter bank F. The conditional entropy (CENT) H(Y |C,F) is shown in theory and experiments to be a highly compact and class-informative code, that can be computed from the filter outputs throughout an existing CNN and used to obtain higher classification results than the original CNN itself. Experiments demonstrate the effectiveness of CENT feature analysis in two separate CNN classification contexts. 1) In the classification of neurodegeneration due to Alzheimer's disease (AD) and natural aging from 3D magnetic resonance image (MRI) volumes, 3 CENT features result in an AUC=94.6 on the public OASIS dataset used and 12 original CNN trained for the task. 2) In the context of visual object classification from 2D photographs, transfer learning based on a small set of CENT features identified throughout an existing CNN leads to AUC values comparable to the 1000-feature softmax output of the original network when classifying previously unseen object categories. The general information theoretical analysis explains various recent CNN design successes, e.g. densely connected CNN architectures, and provides insights for future research directions in deep learning.

READ FULL TEXT

page 4

page 5

research
05/27/2021

Learning Structures for Deep Neural Networks

In this paper, we focus on the unsupervised setting for structure learni...
research
10/09/2018

Convolutional Neural Networks In Convolution

Currently, increasingly deeper neural networks have been applied to impr...
research
01/12/2016

Using Filter Banks in Convolutional Neural Networks for Texture Classification

Deep learning has established many new state of the art solutions in the...
research
11/29/2017

Towards Alzheimer's Disease Classification through Transfer Learning

Detection of Alzheimer's Disease (AD) from neuroimaging data such as MRI...
research
09/21/2020

Improving Automated COVID-19 Grading with Convolutional Neural Networks in Computed Tomography Scans: An Ablation Study

Amidst the ongoing pandemic, several studies have shown that COVID-19 cl...
research
08/17/2018

Dynamic Routing on Deep Neural Network for Thoracic Disease Classification and Sensitive Area Localization

We present and evaluate a new deep neural network architecture for autom...
research
09/07/2017

Improving Sonar Image Patch Matching via Deep Learning

Matching sonar images with high accuracy has been a problem for a long t...

Please sign up or login with your details

Forgot password? Click here to reset