Grounding Language to Entities and Dynamics for Generalization in Reinforcement Learning

01/19/2021
by   H. J. Austin Wang, et al.
0

In this paper, we consider the problem of leveraging textual descriptions to improve generalization of control policies to new scenarios. Unlike prior work in this space, we do not assume access to any form of prior knowledge connecting text and state observations, and learn both symbol grounding and control policy simultaneously. This is challenging due to a lack of concrete supervision, and incorrect groundings can result in worse performance than policies that do not use the text at all. We develop a new model, EMMA (Entity Mapper with Multi-modal Attention) which uses a multi-modal entity-conditioned attention module that allows for selective focus over relevant sentences in the manual for each entity in the environment. EMMA is end-to-end differentiable and can learn a latent grounding of entities and dynamics from text to observations using environment rewards as the only source of supervision. To empirically test our model, we design a new framework of 1320 games and collect text manuals with free-form natural language via crowd-sourcing. We demonstrate that EMMA achieves successful zero-shot generalization to unseen games with new dynamics, obtaining significantly higher rewards compared to multiple baselines. The grounding acquired by EMMA is also robust to noisy descriptions and linguistic variation.

READ FULL TEXT
research
10/25/2022

Entity Divider with Language Grounding in Multi-Agent Reinforcement Learning

We investigate the use of natural language to drive the generalization o...
research
04/23/2018

Attention Based Natural Language Grounding by Navigating Virtual Environment

In this work, we focus on the problem of grounding language by training ...
research
11/26/2022

Who are you referring to? Weakly supervised coreference resolution with multimodal grounding

Coreference resolution aims at identifying words and phrases which refer...
research
10/14/2019

Dynamic Attention Networks for Task Oriented Grounding

In order to successfully perform tasks specified by natural language ins...
research
08/01/2017

Deep Transfer in Reinforcement Learning by Language Grounding

In this paper, we explore the utilization of natural language to drive t...
research
08/19/2019

Transfer in Deep Reinforcement Learning using Knowledge Graphs

Text adventure games, in which players must make sense of the world thro...
research
04/06/2020

Zero-Shot Learning of Text Adventure Games with Sentence-Level Semantics

Reinforcement learning algorithms such as Q-learning have shown great pr...

Please sign up or login with your details

Forgot password? Click here to reset