The Mean Dimension of Neural Networks – What causes the interaction effects?

07/11/2022
by   Roman Hahn, et al.
0

Owen and Hoyt recently showed that the effective dimension offers key structural information about the input-output mapping underlying an artificial neural network. Along this line of research, this work proposes an estimation procedure that allows the calculation of the mean dimension from a given dataset, without resampling from external distributions. The design yields total indices when features are independent and a variant of total indices when features are correlated. We show that this variant possesses the zero independence property. With synthetic datasets, we analyse how the mean dimension evolves layer by layer and how the activation function impacts the magnitude of interactions. We then use the mean dimension to study some of the most widely employed convolutional architectures for image recognition (LeNet, ResNet, DenseNet). To account for pixel correlations, we propose calculating the mean dimension after the addition of an inverse PCA layer that allows one to work on uncorrelated PCA-transformed features, without the need to retrain the neural network. We use the generalized total indices to produce heatmaps for post-hoc explanations, and we employ the mean dimension on the PCA-transformed features for cross comparisons of the artificial neural networks structures. Results provide several insights on the difference in magnitude of interactions across the architectures, as well as indications on how the mean dimension evolves during training.

READ FULL TEXT

page 11

page 12

research
02/05/2018

Artificial neural network based modelling approach for municipal solid waste gasification in a fluidized bed reactor

In this paper, multi-layer feed forward neural networks are used to pred...
research
08/01/2023

Understanding Activation Patterns in Artificial Neural Networks by Exploring Stochastic Processes

To gain a deeper understanding of the behavior and learning dynamics of ...
research
10/16/2019

Structural Analysis of Sparse Neural Networks

Sparse Neural Networks regained attention due to their potential for mat...
research
07/02/2020

Efficient estimation of the ANOVA mean dimension, with an application to neural net classification

The mean dimension of a black box function of d variables is a convenien...
research
09/11/2018

Deep Asymmetric Networks with a Set of Node-wise Variant Activation Functions

This work presents deep asymmetric networks with a set of node-wise vari...

Please sign up or login with your details

Forgot password? Click here to reset