Human Apprenticeship Learning via Kernel-based Inverse Reinforcement Learning

02/25/2020
by   Mark A. Rucker, et al.
0

This paper considers if a reward function learned via inverse reinforcement from a human expert can be used as a feedback intervention to alter future human performance as desired (i.e., human to human apprenticeship learning). To learn reward functions two new algorithms are developed: a kernel-based inverse reinforcement learning algorithm and a Monte Carlo reinforcement learning algorithm. The algorithms are benchmarked against well-known alternatives within their respective corpus and are shown to outperform in terms of efficiency and optimality. To test the feedback intervention two randomized experiments are performed with 3,256 human participants. The experimental results demonstrate with significance that the rewards learned from "expert" individuals are effective as feedback interventions. In addition to the algorithmic contributions and successful experiments, the paper also describes three reward function modifications to improve reward function feedback interventions for humans.

READ FULL TEXT

page 8

page 23

research
02/16/2021

Inverse Reinforcement Learning in the Continuous Setting with Formal Guarantees

Inverse Reinforcement Learning (IRL) is the problem of finding a reward ...
research
11/26/2021

Learning Long-Term Reward Redistribution via Randomized Return Decomposition

Many practical applications of reinforcement learning require agents to ...
research
01/26/2023

Principled Reinforcement Learning with Human Feedback from Pairwise or K-wise Comparisons

We provide a theoretical framework for Reinforcement Learning with Human...
research
11/16/2021

Reinforcement Learning with Feedback from Multiple Humans with Diverse Skills

A promising approach to improve the robustness and exploration in Reinfo...
research
08/22/2018

Robust Counterfactual Inferences using Feature Learning and their Applications

In a wide variety of applications, including personalization, we want to...
research
05/25/2018

Visceral Machines: Reinforcement Learning with Intrinsic Rewards that Mimic the Human Nervous System

The human autonomic nervous system has evolved over millions of years an...
research
11/03/2020

Online Observer-Based Inverse Reinforcement Learning

In this paper, a novel approach to the output-feedback inverse reinforce...

Please sign up or login with your details

Forgot password? Click here to reset