Beyond accuracy: generalization properties of bio-plausible temporal credit assignment rules

06/02/2022
by   Yuhan Helena Liu, et al.
13

To unveil how the brain learns, ongoing work seeks biologically-plausible approximations of gradient descent algorithms for training recurrent neural networks (RNNs). Yet, beyond task accuracy, it is unclear if such learning rules converge to solutions that exhibit different levels of generalization than their nonbiologically-plausible counterparts. Leveraging results from deep learning theory based on loss landscape curvature, we ask: how do biologically-plausible gradient approximations affect generalization? We first demonstrate that state-of-the-art biologically-plausible learning rules for training RNNs exhibit worse and more variable generalization performance compared to their machine learning counterparts that follow the true gradient more closely. Next, we verify that such generalization performance is correlated significantly with loss landscape curvature, and we show that biologically-plausible learning rules tend to approach high-curvature regions in synaptic weight space. Using tools from dynamical systems, we derive theoretical arguments and present a theorem explaining this phenomenon. This predicts our numerical results, and explains why biologically-plausible rules lead to worse and more variable generalization properties. Finally, we suggest potential remedies that could be used by the brain to mitigate this effect. To our knowledge, our analysis is the first to identify the reason for this generalization gap between artificial and biologically-plausible learning rules, which can help guide future investigations into how the brain learns solutions that generalize.

READ FULL TEXT

page 3

page 4

page 6

page 7

page 27

page 28

page 29

research
06/27/2022

Distinguishing Learning Rules with Brain Machine Interfaces

Despite extensive theoretical work on biologically plausible learning ru...
research
11/05/2018

A Biologically Plausible Learning Rule for Deep Learning in the Brain

Researchers have proposed that deep learning, which is providing importa...
research
10/23/2020

A biologically plausible neural network for Slow Feature Analysis

Learning latent features from time series data is an important problem i...
research
06/03/2019

Learning to solve the credit assignment problem

Backpropagation is driving today's artificial neural networks (ANNs). Ho...
research
11/15/2019

Ghost Units Yield Biologically Plausible Backprop in Deep Neural Networks

In the past few years, deep learning has transformed artificial intellig...
research
05/31/2021

A remark on a paper of Krotov and Hopfield [arXiv:2008.06996]

In their recent paper titled "Large Associative Memory Problem in Neurob...
research
09/23/2019

AHA! an 'Artificial Hippocampal Algorithm' for Episodic Machine Learning

The majority of ML research concerns slow, statistical learning of i.i.d...

Please sign up or login with your details

Forgot password? Click here to reset