Entropy-based Stability-Plasticity for Lifelong Learning

04/18/2022
by   Vladimir Araujo, et al.
13

The ability to continuously learn remains elusive for deep learning models. Unlike humans, models cannot accumulate knowledge in their weights when learning new tasks, mainly due to an excess of plasticity and the low incentive to reuse weights when training a new task. To address the stability-plasticity dilemma in neural networks, we propose a novel method called Entropy-based Stability-Plasticity (ESP). Our approach can decide dynamically how much each model layer should be modified via a plasticity factor. We incorporate branch layers and an entropy-based criterion into the model to find such factor. Our experiments in the domains of natural language and vision show the effectiveness of our approach in leveraging prior knowledge by reducing interference. Also, in some cases, it is possible to freeze layers during training leading to speed up in training.

READ FULL TEXT
research
02/16/2018

Learning beyond datasets: Knowledge Graph Augmented Neural Networks for Natural language Processing

Machine Learning has been the quintessential solution for many AI proble...
research
10/27/2019

Prediction stability as a criterion in active learning

Recent breakthroughs made by deep learning rely heavily on large number ...
research
04/20/2023

Backpropagation-free Training of Deep Physical Neural Networks

Recent years have witnessed the outstanding success of deep learning in ...
research
03/11/2023

Stabilizing Transformer Training by Preventing Attention Entropy Collapse

Training stability is of great importance to Transformers. In this work,...
research
06/30/2020

Maximum Entropy Models for Fast Adaptation

Deep Neural Networks have shown great promise on a variety of downstream...
research
04/12/2018

Learned Deformation Stability in Convolutional Neural Networks

Conventional wisdom holds that interleaved pooling layers in convolution...
research
03/03/2023

a q-EW-TOPSIS model of grey correlation for supply capacity evaluation

The paper describes a new supply capacity evaluation model based on the ...

Please sign up or login with your details

Forgot password? Click here to reset