Modular Networks Prevent Catastrophic Interference in Model-Based Multi-Task Reinforcement Learning

11/15/2021
by   Robin Schiewer, et al.
0

In a multi-task reinforcement learning setting, the learner commonly benefits from training on multiple related tasks by exploiting similarities among them. At the same time, the trained agent is able to solve a wider range of different problems. While this effect is well documented for model-free multi-task methods, we demonstrate a detrimental effect when using a single learned dynamics model for multiple tasks. Thus, we address the fundamental question of whether model-based multi-task reinforcement learning benefits from shared dynamics models in a similar way model-free methods do from shared policy networks. Using a single dynamics model, we see clear evidence of task confusion and reduced performance. As a remedy, enforcing an internal structure for the learned dynamics model by training isolated sub-networks for each task notably improves performance while using the same amount of parameters. We illustrate our findings by comparing both methods on a simple gridworld and a more complex vizdoom multi-task experiment.

READ FULL TEXT

page 10

page 12

research
09/22/2019

Multi-task Learning and Catastrophic Forgetting in Continual Reinforcement Learning

In this paper we investigate two hypothesis regarding the use of deep re...
research
05/22/2023

Multi-task Hierarchical Adversarial Inverse Reinforcement Learning

Multi-task Imitation Learning (MIL) aims to train a policy capable of pe...
research
09/12/2018

Multi-task Deep Reinforcement Learning with PopArt

The reinforcement learning community has made great strides in designing...
research
10/21/2022

PaCo: Parameter-Compositional Multi-Task Reinforcement Learning

The purpose of multi-task reinforcement learning (MTRL) is to train a si...
research
03/07/2016

Learning Shared Representations in Multi-task Reinforcement Learning

We investigate a paradigm in multi-task reinforcement learning (MT-RL) i...
research
05/25/2022

Real-Time Video Deblurring via Lightweight Motion Compensation

While motion compensation greatly improves video deblurring quality, sep...
research
03/21/2019

Towards automatic construction of multi-network models for heterogeneous multi-task learning

Multi-task learning, as it is understood nowadays, consists of using one...

Please sign up or login with your details

Forgot password? Click here to reset