BabyAI++: Towards Grounded-Language Learning beyond Memorization

04/15/2020
by   Tianshi Cao, et al.
0

Despite success in many real-world tasks (e.g., robotics), reinforcement learning (RL) agents still learn from tabula rasa when facing new and dynamic scenarios. By contrast, humans can offload this burden through textual descriptions. Although recent works have shown the benefits of instructive texts in goal-conditioned RL, few have studied whether descriptive texts help agents to generalize across dynamic environments. To promote research in this direction, we introduce a new platform, BabyAI++, to generate various dynamic environments along with corresponding descriptive texts. Moreover, we benchmark several baselines inherited from the instruction following setting and develop a novel approach towards visually-grounded language learning on our platform. Extensive experiments show strong evidence that using descriptive texts improves the generalization of RL agents across environments with varied dynamics.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/02/2021

Improving Reinforcement Learning with Human Assistance: An Argument for Human Subject Studies with HIPPO Gym

Reinforcement learning (RL) is a popular machine learning paradigm for g...
research
06/12/2020

Language-Conditioned Goal Generation: a New Approach to Language Grounding for RL

In the real world, linguistic agents are also embodied agents: they perc...
research
06/14/2023

Simple Embodied Language Learning as a Byproduct of Meta-Reinforcement Learning

Whereas machine learning models typically learn language by directly tra...
research
10/27/2021

Learning Domain Invariant Representations in Goal-conditioned Block MDPs

Deep Reinforcement Learning (RL) is successful in solving many complex M...
research
12/06/2019

VALAN: Vision and Language Agent Navigation

VALAN is a lightweight and scalable software framework for deep reinforc...
research
01/01/2021

When Is Generalizable Reinforcement Learning Tractable?

Agents trained by reinforcement learning (RL) often fail to generalize b...
research
10/18/2019

RTFM: Generalising to Novel Environment Dynamics via Reading

Obtaining policies that can generalise to new environments in reinforcem...

Please sign up or login with your details

Forgot password? Click here to reset