Overcome Anterograde Forgetting with Cycled Memory Networks

12/04/2021
by   Jian Peng, et al.
0

Learning from a sequence of tasks for a lifetime is essential for an agent towards artificial general intelligence. This requires the agent to continuously learn and memorize new knowledge without interference. This paper first demonstrates a fundamental issue of lifelong learning using neural networks, named anterograde forgetting, i.e., preserving and transferring memory may inhibit the learning of new knowledge. This is attributed to the fact that the learning capacity of a neural network will be reduced as it keeps memorizing historical knowledge, and the fact that conceptual confusion may occur as it transfers irrelevant old knowledge to the current task. This work proposes a general framework named Cycled Memory Networks (CMN) to address the anterograde forgetting in neural networks for lifelong learning. The CMN consists of two individual memory networks to store short-term and long-term memories to avoid capacity shrinkage. A transfer cell is designed to connect these two memory networks, enabling knowledge transfer from the long-term memory network to the short-term memory network to mitigate the conceptual confusion, and a memory consolidation mechanism is developed to integrate short-term knowledge into the long-term memory network for knowledge accumulation. Experimental results demonstrate that the CMN can effectively address the anterograde forgetting on several task-related, task-conflict, class-incremental and cross-domain benchmarks.

READ FULL TEXT

page 7

page 14

research
05/20/2019

Continual Learning in Deep Neural Networks by Using a Kalman Optimiser

Learning and adapting to new distributions or learning new tasks sequent...
research
10/20/2016

A Growing Long-term Episodic & Semantic Memory

The long-term memory of most connectionist systems lies entirely in the ...
research
09/22/2021

Palimpsest Memories Stored in Memristive Synapses

Biological synapses store multiple memories on top of each other in a pa...
research
01/05/2019

Learning to Remember More with Less Memorization

Memory-augmented neural networks consisting of a neural controller and a...
research
08/04/2020

Applying Incremental Deep Neural Networks-based Posture Recognition Model for Injury Risk Assessment in Construction

Monitoring awkward postures is a proactive prevention for Musculoskeleta...
research
02/17/2014

Does the D.C. Response of Memristors Allow Robotic Short-Term Memory and a Possible Route to Artificial Time Perception?

Time perception is essential for task switching, and in the mammalian br...
research
05/02/2017

Analyzing Knowledge Transfer in Deep Q-Networks for Autonomously Handling Multiple Intersections

We analyze how the knowledge to autonomously handle one type of intersec...

Please sign up or login with your details

Forgot password? Click here to reset