Learning Shaping Strategies in Human-in-the-loop Interactive Reinforcement Learning

11/10/2018
by   Chao Yu, et al.
0

Providing reinforcement learning agents with informationally rich human knowledge can dramatically improve various aspects of learning. Prior work has developed different kinds of shaping methods that enable agents to learn efficiently in complex environments. All these methods, however, tailor human guidance to agents in specialized shaping procedures, thus embodying various characteristics and advantages in different domains. In this paper, we investigate the interplay between different shaping methods for more robust learning performance. We propose an adaptive shaping algorithm which is capable of learning the most suitable shaping method in an on-line manner. Results in two classic domains verify its effectiveness from both simulated and real human studies, shedding some light on the role and impact of human factors in human-robot collaborative learning.

READ FULL TEXT
research
01/15/2017

Agent-Agnostic Human-in-the-Loop Reinforcement Learning

Providing Reinforcement Learning agents with expert advice can dramatica...
research
03/30/2023

Learning Human-to-Robot Handovers from Point Clouds

We propose the first framework to learn control policies for vision-base...
research
11/02/2020

Incorporating Rivalry in Reinforcement Learning for a Competitive Game

Recent advances in reinforcement learning with social agents have allowe...
research
03/06/2021

Reinforcement Learning, Bit by Bit

Reinforcement learning agents have demonstrated remarkable achievements ...
research
03/19/2022

Teachable Reinforcement Learning via Advice Distillation

Training automated agents to complete complex tasks in interactive envir...
research
10/02/2019

Unsupervised Doodling and Painting with Improved SPIRAL

We investigate using reinforcement learning agents as generative models ...
research
11/05/2019

A Deep Reinforcement Learning based Approach to Learning Transferable Proof Guidance Strategies

Traditional first-order logic (FOL) reasoning systems usually rely on ma...

Please sign up or login with your details

Forgot password? Click here to reset