Action Advising with Advice Imitation in Deep Reinforcement Learning

04/17/2021
by   Ercüment İlhan, et al.
0

Action advising is a peer-to-peer knowledge exchange technique built on the teacher-student paradigm to alleviate the sample inefficiency problem in deep reinforcement learning. Recently proposed student-initiated approaches have obtained promising results. However, due to being in the early stages of development, these also have some substantial shortcomings. One of the abilities that are absent in the current methods is further utilising advice by reusing, which is especially crucial in the practical settings considering the budget and cost constraints in peer-to-peer. In this study, we present an approach to enable the student agent to imitate previously acquired advice to reuse them directly in its exploration policy, without any interventions in the learning mechanism itself. In particular, we employ a behavioural cloning module to imitate the teacher policy and use dropout regularisation to have a notion of epistemic uncertainty to keep track of which state-advice pairs are actually collected. As the results of experiments we conducted in three Atari games show, advice reusing via generalisation is indeed a feasible option in deep RL and our approach can successfully achieve this while significantly improving the learning performance, even when paired with a simple early advising heuristic.

READ FULL TEXT
research
10/01/2020

Student-Initiated Action Advising via Advice Novelty

Action advising is a knowledge exchange mechanism between peers, namely ...
research
04/17/2021

Learning on a Budget via Teacher Imitation

Deep Reinforcement Learning (RL) techniques can benefit greatly from lev...
research
04/19/2019

Teaching on a Budget in Multi-Agent Deep Reinforcement Learning

Deep Reinforcement Learning algorithms can solve complex sequential deci...
research
04/14/2022

Methodical Advice Collection and Reuse in Deep Reinforcement Learning

Reinforcement learning (RL) has shown great success in solving many chal...
research
03/10/2018

Kickstarting Deep Reinforcement Learning

We present a method for using previously-trained 'teacher' agents to kic...
research
02/06/2020

Transfer Heterogeneous Knowledge Among Peer-to-Peer Teammates: A Model Distillation Approach

Peer-to-peer knowledge transfer in distributed environments has emerged ...
research
07/28/2017

Learning to Teach Reinforcement Learning Agents

In this article we study the transfer learning model of action advice un...

Please sign up or login with your details

Forgot password? Click here to reset