A Narration-based Reward Shaping Approach using Grounded Natural Language Commands

10/31/2019
by   Nicholas Waytowich, et al.
15

While deep reinforcement learning techniques have led to agents that are successfully able to learn to perform a number of tasks that had been previously unlearnable, these techniques are still susceptible to the longstanding problem of reward sparsity. This is especially true for tasks such as training an agent to play StarCraft II, a real-time strategy game where reward is only given at the end of a game which is usually very long. While this problem can be addressed through reward shaping, such approaches typically require a human expert with specialized knowledge. Inspired by the vision of enabling reward shaping through the more-accessible paradigm of natural-language narration, we develop a technique that can provide the benefits of reward shaping using natural language commands. Our narration-guided RL agent projects sequences of natural-language commands into the same high-dimensional representation space as corresponding goal states. We show that we can get improved performance with our method compared to traditional reward-shaping approaches. Additionally, we demonstrate the ability of our method to generalize to unseen natural-language commands.

READ FULL TEXT

page 5

page 8

page 12

research
04/24/2019

Grounding Natural Language Commands to StarCraft II Game States for Narration-Guided Reinforcement Learning

While deep reinforcement learning techniques have led to agents that are...
research
07/21/2020

Soft Expert Reward Learning for Vision-and-Language Navigation

Vision-and-Language Navigation (VLN) requires an agent to find a specifi...
research
04/27/2018

Reward Learning from Narrated Demonstrations

Humans effortlessly "program" one another by communicating goals and des...
research
11/06/2018

An Optimal Itinerary Generation in a Configuration Space of Large Intellectual Agent Groups with Linear Logic

A group of intelligent agents which fulfill a set of tasks in parallel i...
research
11/03/2022

lilGym: Natural Language Visual Reasoning with Reinforcement Learning

We present lilGym, a new benchmark for language-conditioned reinforcemen...
research
09/23/2021

Reinforced Natural Language Interfaces via Entropy Decomposition

In this paper, we study the technical problem of developing conversation...
research
07/26/2017

Guiding Reinforcement Learning Exploration Using Natural Language

In this work we present a technique to use natural language to help rein...

Please sign up or login with your details

Forgot password? Click here to reset