DQN-TAMER: Human-in-the-Loop Reinforcement Learning with Intractable Feedback

10/28/2018
by   Riku Arakawa, et al.
0

Exploration has been one of the greatest challenges in reinforcement learning (RL), which is a large obstacle in the application of RL to robotics. Even with state-of-the-art RL algorithms, building a well-learned agent often requires too many trials, mainly due to the difficulty of matching its actions with rewards in the distant future. A remedy for this is to train an agent with real-time feedback from a human observer who immediately gives rewards for some actions. This study tackles a series of challenges for introducing such a human-in-the-loop RL scheme. The first contribution of this work is our experiments with a precisely modeled human observer: binary, delay, stochasticity, unsustainability, and natural reaction. We also propose an RL method called DQN-TAMER, which efficiently uses both human feedback and distant rewards. We find that DQN-TAMER agents outperform their baselines in Maze and Taxi simulated environments. Furthermore, we demonstrate a real-world human-in-the-loop RL application where a camera automatically recognizes a user's facial expressions as feedback to the agent while the agent explores a maze.

READ FULL TEXT

page 1

page 6

research
06/30/2020

Accelerating Reinforcement Learning Agent with EEG-based Implicit Human Feedback

Providing Reinforcement Learning (RL) agents with human feedback can dra...
research
10/07/2022

Advice Conformance Verification by Reinforcement Learning agents for Human-in-the-Loop

Human-in-the-loop (HiL) reinforcement learning is gaining traction in do...
research
07/17/2020

Explanation Augmented Feedback in Human-in-the-Loop Reinforcement Learning

Human-in-the-loop Reinforcement Learning (HRL) aims to integrate human g...
research
06/27/2022

Humans are not Boltzmann Distributions: Challenges and Opportunities for Modelling Human Feedback and Interaction in Reinforcement Learning

Reinforcement learning (RL) commonly assumes access to well-specified re...
research
12/02/2021

Towards Intrinsic Interactive Reinforcement Learning

Reinforcement learning (RL) and brain-computer interfaces (BCI) are two ...
research
08/30/2022

Distributed Ensembles of Reinforcement Learning Agents for Electricity Control

Deep Reinforcement Learning (or just "RL") is gaining popularity for ind...
research
10/01/2019

Accelerated Robot Learning via Human Brain Signals

In reinforcement learning (RL), sparse rewards are a natural way to spec...

Please sign up or login with your details

Forgot password? Click here to reset