Continual Learning with Recursive Gradient Optimization

01/29/2022
by   Hao Liu, et al.
0

Learning multiple tasks sequentially without forgetting previous knowledge, called Continual Learning(CL), remains a long-standing challenge for neural networks. Most existing methods rely on additional network capacity or data replay. In contrast, we introduce a novel approach which we refer to as Recursive Gradient Optimization(RGO). RGO is composed of an iteratively updated optimizer that modifies the gradient to minimize forgetting without data replay and a virtual Feature Encoding Layer(FEL) that represents different long-term structures with only task descriptors. Experiments demonstrate that RGO has significantly better performance on popular continual classification benchmarks when compared to the baselines and achieves new state-of-the-art performance on 20-split-CIFAR100(82.22 average accuracy than Single-Task Learning(STL), this method is flexible and reliable to provide continual learning capabilities for learning models that rely on gradient descent.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/04/2022

A Benchmark and Empirical Analysis for Replay Strategies in Continual Learning

With the capacity of continual learning, humans can continuously acquire...
research
08/08/2023

Improving Performance in Continual Learning Tasks using Bio-Inspired Architectures

The ability to learn continuously from an incoming data stream without c...
research
01/29/2022

Learning Fast, Learning Slow: A General Continual Learning Method based on Complementary Learning System

Humans excel at continually learning from an ever-changing environment w...
research
03/17/2021

Gradient Projection Memory for Continual Learning

The ability to learn continually without forgetting the past tasks is a ...
research
08/30/2023

Introducing Language Guidance in Prompt-based Continual Learning

Continual Learning aims to learn a single model on a sequence of tasks w...
research
03/03/2022

Continual SLAM: Beyond Lifelong Simultaneous Localization and Mapping through Continual Learning

While lifelong SLAM addresses the capability of a robot to adapt to chan...
research
06/19/2023

Partial Hypernetworks for Continual Learning

Hypernetworks mitigate forgetting in continual learning (CL) by generati...

Please sign up or login with your details

Forgot password? Click here to reset