Powderworld: A Platform for Understanding Generalization via Rich Task Distributions

11/23/2022
by   Kevin Frans, et al.
0

One of the grand challenges of reinforcement learning is the ability to generalize to new tasks. However, general agents require a set of rich, diverse tasks to train on. Designing a `foundation environment' for such tasks is tricky – the ideal environment would support a range of emergent phenomena, an expressive task space, and fast runtime. To take a step towards addressing this research bottleneck, this work presents Powderworld, a lightweight yet expressive simulation environment running directly on the GPU. Within Powderworld, two motivating challenges distributions are presented, one for world-modelling and one for reinforcement learning. Each contains hand-designed test tasks to examine generalization. Experiments indicate that increasing the environment's complexity improves generalization for world models and certain reinforcement learning agents, yet may inhibit learning in high-variance environments. Powderworld aims to support the study of generalization by providing a source of diverse tasks arising from the same core rules.

READ FULL TEXT

page 2

page 5

page 9

page 16

page 17

page 19

page 20

page 21

research
10/20/2019

Autonomous Industrial Management via Reinforcement Learning: Self-Learning Agents for Decision-Making – A Review

Industry has always been in the pursuit of becoming more economically ef...
research
08/31/2018

APES: a Python toolbox for simulating reinforcement learning environments

Assisted by neural networks, reinforcement learning agents have been abl...
research
04/26/2020

Reinforcement Learning Generalization with Surprise Minimization

Generalization remains a challenging problem for reinforcement learning ...
research
12/03/2019

Leveraging Procedural Generation to Benchmark Reinforcement Learning

In this report, we introduce Procgen Benchmark, a suite of 16 procedural...
research
01/07/2018

Building Generalizable Agents with a Realistic and Rich 3D Environment

Towards bridging the gap between machine and human intelligence, it is o...
research
12/13/2022

Improving generalization in reinforcement learning through forked agents

An eco-system of agents each having their own policy with some, but limi...
research
10/16/2020

The Deep Bootstrap: Good Online Learners are Good Offline Generalizers

We propose a new framework for reasoning about generalization in deep le...

Please sign up or login with your details

Forgot password? Click here to reset