Learning to Follow Instructions in Text-Based Games

11/08/2022
by   Mathieu Tuli, et al.
0

Text-based games present a unique class of sequential decision making problem in which agents interact with a partially observable, simulated environment via actions and observations conveyed through natural language. Such observations typically include instructions that, in a reinforcement learning (RL) setting, can directly or indirectly guide a player towards completing reward-worthy tasks. In this work, we study the ability of RL agents to follow such instructions. We conduct experiments that show that the performance of state-of-the-art text-based game agents is largely unaffected by the presence or absence of such instructions, and that these agents are typically unable to execute tasks to completion. To further study and address the task of instruction following, we equip RL agents with an internal structured representation of natural language instructions in the form of Linear Temporal Logic (LTL), a formal language that is increasingly used for temporally extended reward specification in RL. Our framework both supports and highlights the benefit of understanding the temporal semantics of instructions and in measuring progress towards achievement of such a temporally extended behaviour. Experiments with 500+ games in TextWorld demonstrate the superior performance of our approach.

READ FULL TEXT
research
02/08/2023

Temporal Video-Language Alignment Network for Reward Shaping in Reinforcement Learning

Designing appropriate reward functions for Reinforcement Learning (RL) a...
research
02/13/2021

LTL2Action: Generalizing LTL Instructions for Multi-Task RL

We address the problem of teaching a deep reinforcement learning (RL) ag...
research
05/19/2020

Human Instruction-Following with Deep Reinforcement Learning via Transfer-Learning from Text

Recent work has described neural-network-based agents that are trained w...
research
01/25/2020

Following Instructions by Imagining and Reaching Visual Goals

While traditional methods for instruction-following typically assume pri...
research
05/26/2023

A Reminder of its Brittleness: Language Reward Shaping May Hinder Learning for Instruction Following Agents

Teaching agents to follow complex written instructions has been an impor...
research
06/07/2021

Playing with words: Do people exploit loaded language to affect others' decisions for their own benefit?

In this article, we study whether people in the position of describing a...
research
05/31/2023

From Pixels to UI Actions: Learning to Follow Instructions via Graphical User Interfaces

Much of the previous work towards digital agents for graphical user inte...

Please sign up or login with your details

Forgot password? Click here to reset