Interactive Reinforcement Learning with Dynamic Reuse of Prior Knowledge from Human/Agent's Demonstration

05/11/2018
by   Zhaodong Wang, et al.
0

Reinforcement learning has enjoyed multiple successes in recent years. However, these successes typically require very large amounts of data before an agent achieves acceptable performance. This paper introduces a novel way of combating such requirements by leveraging existing (human or agent) knowledge. In particular, this paper uses demonstrations from agents and humans, allowing an untrained agent to quickly achieve high performance. We empirically compare with, and highlight the weakness of, HAT and CHAT, methods of transferring knowledge from a source agent/human to a target agent. This paper introduces an effective transfer approach, DRoP, combining the offline knowledge (demonstrations recorded before learning) with online confidence-based performance analysis. DRoP dynamically involves the demonstrator's knowledge, integrating it into the reinforcement learning agent's online learning loop to achieve efficient and robust learning.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/01/2022

Lifelong Inverse Reinforcement Learning

Methods for learning from demonstration (LfD) have shown success in acqu...
research
04/12/2020

Reinforcement Learning via Reasoning from Demonstration

Demonstration is an appealing way for humans to provide assistance to re...
research
06/22/2018

Human-Interactive Subgoal Supervision for Efficient Inverse Reinforcement Learning

Humans are able to understand and perform complex tasks by strategically...
research
06/16/2020

Accelerating Online Reinforcement Learning with Offline Datasets

Reinforcement learning provides an appealing formalism for learning cont...
research
10/04/2021

Learning to Assist Agents by Observing Them

The ability of an AI agent to assist other agents, such as humans, is an...
research
09/21/2019

Leveraging Human Guidance for Deep Reinforcement Learning Tasks

Reinforcement learning agents can learn to solve sequential decision tas...
research
06/13/2012

Knowledge Combination in Graphical Multiagent Model

A graphical multiagent model (GMM) represents a joint distribution over ...

Please sign up or login with your details

Forgot password? Click here to reset