lilGym: Natural Language Visual Reasoning with Reinforcement Learning

11/03/2022
by   Anne Wu, et al.
14

We present lilGym, a new benchmark for language-conditioned reinforcement learning in visual environments. lilGym is based on 2,661 highly-compositional human-written natural language statements grounded in an interactive visual environment. We annotate all statements with executable Python programs representing their meaning to enable exact reward computation in every possible world state. Each statement is paired with multiple start states and reward functions to form thousands of distinct Markov Decision Processes of varying difficulty. We experiment with lilGym with different models and learning regimes. Our results and analysis show that while existing methods are able to achieve non-trivial performance, lilGym forms a challenging open problem. lilGym is available at https://lil.nlp.cornell.edu/lilgym/.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/02/2017

Visual Reasoning with Natural Language

Natural language provides a widely accessible and expressive interface f...
research
07/19/2020

An Overview of Natural Language State Representation for Reinforcement Learning

A suitable state representation is a fundamental part of the learning pr...
research
04/14/2017

Environment-Independent Task Specifications via GLTL

We propose a new task-specification language for Markov decision process...
research
10/31/2019

A Narration-based Reward Shaping Approach using Grounded Natural Language Commands

While deep reinforcement learning techniques have led to agents that are...
research
04/24/2019

Grounding Natural Language Commands to StarCraft II Game States for Narration-Guided Reinforcement Learning

While deep reinforcement learning techniques have led to agents that are...
research
02/16/2022

Open-Ended Reinforcement Learning with Neural Reward Functions

Inspired by the great success of unsupervised learning in Computer Visio...
research
07/15/2021

A Reinforcement Learning Environment for Mathematical Reasoning via Program Synthesis

We convert the DeepMind Mathematics Dataset into a reinforcement learnin...

Please sign up or login with your details

Forgot password? Click here to reset