Understanding plasticity in neural networks

03/02/2023
by   Clare Lyle, et al.
0

Plasticity, the ability of a neural network to quickly change its predictions in response to new information, is essential for the adaptability and robustness of deep reinforcement learning systems. Deep neural networks are known to lose plasticity over the course of training even in relatively simple learning problems, but the mechanisms driving this phenomenon are still poorly understood. This paper conducts a systematic empirical analysis into plasticity loss, with the goal of understanding the phenomenon mechanistically in order to guide the future development of targeted solutions. We find that loss of plasticity is deeply connected to changes in the curvature of the loss landscape, but that it typically occurs in the absence of saturated units or divergent gradient norms. Based on this insight, we identify a number of parameterization and optimization design choices which enable networks to better preserve plasticity over the course of training. We validate the utility of these findings in larger-scale learning problems by applying the best-performing intervention, layer normalization, to a deep RL agent trained on the Arcade Learning Environment.

READ FULL TEXT

page 7

page 8

page 13

page 14

page 15

page 18

research
04/20/2022

Understanding and Preventing Capacity Loss in Reinforcement Learning

The reinforcement learning (RL) problem is rife with sources of non-stat...
research
10/27/2020

Implicit Under-Parameterization Inhibits Data-Efficient Deep Reinforcement Learning

We identify an implicit under-parameterization phenomenon in value-based...
research
03/13/2023

Loss of Plasticity in Continual Deep Reinforcement Learning

The ability to learn continually is essential in a complex and changing ...
research
01/29/2021

Layer-Peeled Model: Toward Understanding Well-Trained Deep Neural Networks

In this paper, we introduce the Layer-Peeled Model, a nonconvex yet anal...
research
12/11/2022

Generalization Through the Lens of Learning Dynamics

A machine learning (ML) system must learn not only to match the output o...
research
03/18/2022

Half-Inverse Gradients for Physical Deep Learning

Recent works in deep learning have shown that integrating differentiable...
research
04/29/2019

A Review of Modularization Techniques in Artificial Neural Networks

Artificial neural networks (ANNs) have achieved significant success in t...

Please sign up or login with your details

Forgot password? Click here to reset