Improving Deep Reinforcement Learning in Minecraft with Action Advice

08/02/2019
by   Spencer Frazier, et al.
10

Training deep reinforcement learning agents complex behaviors in 3D virtual environments requires significant computational resources. This is especially true in environments with high degrees of aliasing, where many states share nearly identical visual features. Minecraft is an exemplar of such an environment. We hypothesize that interactive machine learning IML, wherein human teachers play a direct role in training through demonstrations, critique, or action advice, may alleviate agent susceptibility to aliasing. However, interactive machine learning is only practical when the number of human interactions is limited, requiring a balance between human teacher effort and agent performance. We conduct experiments with two reinforcement learning algorithms which enable human teachers to give action advice, Feedback Arbitration and Newtonian Action Advice, under visual aliasing conditions. To assess potential cognitive load per advice type, we vary the accuracy and frequency of various human action advice techniques. Training efficiency, robustness against infrequent and inaccurate advisor input, and sensitivity to aliasing are examined.

READ FULL TEXT

page 2

page 6

research
09/12/2017

Explore, Exploit or Listen: Combining Human Feedback and Policy Model to Speed up Deep Reinforcement Learning in 3D Worlds

We describe a method to use discrete human feedback to enhance the perfo...
research
04/16/2018

Newtonian Action Advice: Integrating Human Verbal Instruction with Reinforcement Learning

A goal of Interactive Machine Learning (IML) is to enable people without...
research
10/14/2022

Multi-trainer Interactive Reinforcement Learning System

Interactive reinforcement learning can effectively facilitate the agent ...
research
12/16/2021

Deep Reinforcement Learning Policies Learn Shared Adversarial Features Across MDPs

The use of deep neural networks as function approximators has led to str...
research
10/08/2021

CheerBots: Chatbots toward Empathy and Emotionusing Reinforcement Learning

Apart from the coherence and fluency of responses, an empathetic chatbot...
research
09/21/2019

Leveraging Human Guidance for Deep Reinforcement Learning Tasks

Reinforcement learning agents can learn to solve sequential decision tas...
research
02/19/2022

Teaching Drones on the Fly: Can Emotional Feedback Serve as Learning Signal for Training Artificial Agents?

We investigate whether naturalistic emotional human feedback can be dire...

Please sign up or login with your details

Forgot password? Click here to reset