Building a Subspace of Policies for Scalable Continual Learning

11/18/2022
by   Jean-Baptiste Gaya, et al.
0

The ability to continuously acquire new knowledge and skills is crucial for autonomous agents. Existing methods are typically based on either fixed-size models that struggle to learn a large number of diverse behaviors, or growing-size models that scale poorly with the number of tasks. In this work, we aim to strike a better balance between an agent's size and performance by designing a method that grows adaptively depending on the task sequence. We introduce Continual Subspace of Policies (CSP), a new approach that incrementally builds a subspace of policies for training a reinforcement learning agent on a sequence of tasks. The subspace's high expressivity allows CSP to perform well for many different tasks while growing sublinearly with the number of tasks. Our method does not suffer from forgetting and displays positive transfer to new tasks. CSP outperforms a number of popular baselines on a wide range of scenarios from two challenging domains, Brax (locomotion) and Continual World (manipulation).

READ FULL TEXT

page 18

page 19

page 30

research
05/23/2021

Continual World: A Robotic Benchmark For Continual Reinforcement Learning

Continual learning (CL) – the ability to continuously learn, building on...
research
02/22/2018

Unicorn: Continual Learning with a Universal, Off-policy Agent

Some real-world domains are best characterized as a single task, but for...
research
10/15/2019

Compacting, Picking and Growing for Unforgetting Continual Learning

Continual lifelong learning is essential to many applications. In this p...
research
10/19/2021

A Simple Approach to Continual Learning by Transferring Skill Parameters

In order to be effective general purpose machines in real world environm...
research
09/28/2022

Disentangling Transfer in Continual Reinforcement Learning

The ability of continual learning systems to transfer knowledge from pre...
research
09/25/2020

Continual Model-Based Reinforcement Learning with Hypernetworks

Effective planning in model-based reinforcement learning (MBRL) and mode...
research
10/23/2020

A Combinatorial Perspective on Transfer Learning

Human intelligence is characterized not only by the capacity to learn co...

Please sign up or login with your details

Forgot password? Click here to reset