Knowledge Transfer in Multi-Task Deep Reinforcement Learning for Continuous Control

10/15/2020
by   Zhiyuan Xu, et al.
70

While Deep Reinforcement Learning (DRL) has emerged as a promising approach to many complex tasks, it remains challenging to train a single DRL agent that is capable of undertaking multiple different continuous control tasks. In this paper, we present a Knowledge Transfer based Multi-task Deep Reinforcement Learning framework (KTM-DRL) for continuous control, which enables a single DRL agent to achieve expert-level performance in multiple different tasks by learning from task-specific teachers. In KTM-DRL, the multi-task agent first leverages an offline knowledge transfer algorithm designed particularly for the actor-critic architecture to quickly learn a control policy from the experience of task-specific teachers, and then it employs an online learning algorithm to further improve itself by learning from new online transition samples under the guidance of those teachers. We perform a comprehensive empirical study with two commonly-used benchmarks in the MuJoCo continuous control task suite. The experimental results well justify the effectiveness of KTM-DRL and its knowledge transfer and online learning algorithms, as well as its superiority over the state-of-the-art by a large margin.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/24/2023

Multi-Task Reinforcement Learning in Continuous Control with Successor Feature-Based Concurrent Composition

Deep reinforcement learning (DRL) frameworks are increasingly used to so...
research
02/17/2022

VRL3: A Data-Driven Framework for Visual Deep Reinforcement Learning

We propose a simple but powerful data-driven framework for solving highl...
research
10/06/2021

On The Transferability of Deep-Q Networks

Transfer Learning (TL) is an efficient machine learning paradigm that al...
research
09/16/2019

Deep Reinforcement Learning for Task-driven Discovery of Incomplete Networks

Complex networks are often either too large for full exploration, partia...
research
05/20/2020

Two-stage Deep Reinforcement Learning for Inverter-based Volt-VAR Control in Active Distribution Networks

Model-based Vol/VAR optimization method is widely used to eliminate volt...
research
02/09/2021

Measuring Progress in Deep Reinforcement Learning Sample Efficiency

Sampled environment transitions are a critical input to deep reinforceme...
research
12/24/2020

Auto-Agent-Distiller: Towards Efficient Deep Reinforcement Learning Agents via Neural Architecture Search

AlphaGo's astonishing performance has ignited an explosive interest in d...

Please sign up or login with your details

Forgot password? Click here to reset