MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research

09/27/2021
by   Mikayel Samvelyan, et al.
7

The progress in deep reinforcement learning (RL) is heavily driven by the availability of challenging benchmarks used for training agents. However, benchmarks that are widely adopted by the community are not explicitly designed for evaluating specific capabilities of RL methods. While there exist environments for assessing particular open problems in RL (such as exploration, transfer learning, unsupervised environment design, or even language-assisted RL), it is generally difficult to extend these to richer, more complex environments once research goes beyond proof-of-concept results. We present MiniHack, a powerful sandbox framework for easily designing novel RL environments. MiniHack is a one-stop shop for RL experiments with environments ranging from small rooms to complex, procedurally generated worlds. By leveraging the full set of entities and environment dynamics from NetHack, one of the richest grid-based video games, MiniHack allows designing custom RL testbeds that are fast and convenient to use. With this sandbox framework, novel environments can be designed easily, either using a human-readable description language or a simple Python interface. In addition to a variety of RL tasks and baselines, MiniHack can wrap existing RL benchmarks and provide ways to seamlessly add additional complexity.

READ FULL TEXT

page 2

page 6

page 19

page 23

page 24

page 25

page 27

page 28

research
07/13/2022

GriddlyJS: A Web IDE for Reinforcement Learning

Progress in reinforcement learning (RL) research is often driven by the ...
research
06/24/2020

The NetHack Learning Environment

Progress in Reinforcement Learning (RL) algorithms goes hand-in-hand wit...
research
12/05/2021

Benchmark for Out-of-Distribution Detection in Deep Reinforcement Learning

Reinforcement Learning (RL) based solutions are being adopted in a varie...
research
11/11/2022

pyRDDLGym: From RDDL to Gym Environments

We present pyRDDLGym, a Python framework for auto-generation of OpenAI G...
research
10/24/2022

Avalon: A Benchmark for RL Generalization Using Procedurally Generated Worlds

Despite impressive successes, deep reinforcement learning (RL) systems s...
research
06/02/2019

The Principle of Unchanged Optimality in Reinforcement Learning Generalization

Several recent papers have examined generalization in reinforcement lear...
research
07/11/2021

Out-of-Distribution Dynamics Detection: RL-Relevant Benchmarks and Results

We study the problem of out-of-distribution dynamics (OODD) detection, w...

Please sign up or login with your details

Forgot password? Click here to reset