Preventing Deterioration of Classification Accuracy in Predictive Coding Networks

08/15/2022
by   Paul F. Kinghorn, et al.
0

Predictive Coding Networks (PCNs) aim to learn a generative model of the world. Given observations, this generative model can then be inverted to infer the causes of those observations. However, when training PCNs, a noticeable pathology is often observed where inference accuracy peaks and then declines with further training. This cannot be explained by overfitting since both training and test accuracy decrease simultaneously. Here we provide a thorough investigation of this phenomenon and show that it is caused by an imbalance between the speeds at which the various layers of the PCN converge. We demonstrate that this can be prevented by regularising the weight matrices at each layer: by restricting the relative size of matrix singular values, we allow the weight matrix to change but restrict the overall impact which a layer can have on its neighbours. We also demonstrate that a similar effect can be achieved through a more biologically plausible and simple scheme of just capping the weights.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/15/2023

Deep Learning Weight Pruning with RMT-SVD: Increasing Accuracy and Reducing Overfitting

In this work, we present some applications of random matrix theory for t...
research
11/13/2020

Investigating Learning in Deep Neural Networks using Layer-Wise Weight Change

Understanding the per-layer learning dynamics of deep neural networks is...
research
10/26/2019

Making Predictive Coding Networks Generative

Predictive coding (PC) networks are a biologically interesting class of ...
research
03/14/2021

Pre-interpolation loss behaviour in neural networks

When training neural networks as classifiers, it is common to observe an...
research
02/03/2020

Effect of top-down connections in Hierarchical Sparse Coding

Hierarchical Sparse Coding (HSC) is a powerful model to efficiently repr...
research
11/09/2015

Biologically Inspired Dynamic Textures for Probing Motion Perception

Perception is often described as a predictive process based on an optima...

Please sign up or login with your details

Forgot password? Click here to reset