Modular Lifelong Reinforcement Learning via Neural Composition

07/01/2022
by   Jorge A. Mendez, et al.
0

Humans commonly solve complex problems by decomposing them into easier subproblems and then combining the subproblem solutions. This type of compositional reasoning permits reuse of the subproblem solutions when tackling future tasks that share part of the underlying compositional structure. In a continual or lifelong reinforcement learning (RL) setting, this ability to decompose knowledge into reusable components would enable agents to quickly learn new RL tasks by leveraging accumulated compositional structures. We explore a particular form of composition based on neural modules and present a set of RL problems that intuitively admit compositional solutions. Empirically, we demonstrate that neural composition indeed captures the underlying structure of this space of problems. We further propose a compositional lifelong RL method that leverages accumulated neural components to accelerate the learning of future tasks while retaining performance on previous tasks via off-line RL over replayed experiences.

READ FULL TEXT

page 15

page 16

research
07/25/2022

Lifelong Machine Learning of Functionally Compositional Structures

A hallmark of human intelligence is the ability to construct self-contai...
research
07/13/2023

Robotic Manipulation Datasets for Offline Compositional Reinforcement Learning

Offline reinforcement learning (RL) is a promising direction that allows...
research
07/08/2022

CompoSuite: A Compositional Reinforcement Learning Benchmark

We present CompoSuite, an open-source simulated robotic manipulation ben...
research
06/18/2019

Language as an Abstraction for Hierarchical Deep Reinforcement Learning

Solving complex, temporally-extended tasks is a long-standing problem in...
research
08/29/2022

Categorical semantics of compositional reinforcement learning

Reinforcement learning (RL) often requires decomposing a problem into su...
research
07/15/2020

Lifelong Learning of Compositional Structures

A hallmark of human intelligence is the ability to construct self-contai...
research
02/19/2023

Compositionality and Bounds for Optimal Value Functions in Reinforcement Learning

An agent's ability to reuse solutions to previously solved problems is c...

Please sign up or login with your details

Forgot password? Click here to reset