Overcoming catastrophic forgetting with hard attention to the task

01/04/2018
by   Joan Serrà, et al.
0

Catastrophic forgetting occurs when a neural network loses the information learned with the first task, after training on a second task. This problem remains a hurdle for general artificial intelligence systems with sequential learning capabilities. In this paper, we propose a task-based hard attention mechanism that preserves previous tasks' information without substantially affecting the current task's learning. An attention mask is learned concurrently to every task through stochastic gradient descent, and previous masks are exploited to constrain such learning. We show that the proposed mechanism is effective for reducing catastrophic forgetting, cutting current rates by 33 to 84 choices and that it offers a number of monitoring capabilities. The approach features the possibility to control both the stability and compactness of the learned knowledge, which we believe makes it also attractive for online learning and network compression applications.

READ FULL TEXT
research
08/09/2021

Some thoughts on catastrophic forgetting and how to learn an algorithm

The work of McCloskey and Cohen popularized the concept of catastrophic ...
research
04/12/2018

Combating catastrophic forgetting with developmental compression

Generally intelligent agents exhibit successful behavior across problems...
research
09/23/2021

The Role of Bio-Inspired Modularity in General Learning

One goal of general intelligence is to learn novel information without o...
research
05/18/2018

Overcoming catastrophic forgetting problem by weight consolidation and long-term memory

Sequential learning of multiple tasks in artificial neural networks usin...
research
04/05/2019

Reducing catastrophic forgetting when evolving neural networks

A key stepping stone in the development of an artificial general intelli...
research
07/31/2019

Overcoming Catastrophic Forgetting by Neuron-level Plasticity Control

To address the issue of catastrophic forgetting in neural networks, we p...
research
04/06/2017

Encoder Based Lifelong Learning

This paper introduces a new lifelong learning solution where a single mo...

Please sign up or login with your details

Forgot password? Click here to reset