Towards robust and domain agnostic reinforcement learning competitions

06/07/2021
by   William Hebgen Guss, et al.
19

Reinforcement learning competitions have formed the basis for standard research benchmarks, galvanized advances in the state-of-the-art, and shaped the direction of the field. Despite this, a majority of challenges suffer from the same fundamental problems: participant solutions to the posed challenge are usually domain-specific, biased to maximally exploit compute resources, and not guaranteed to be reproducible. In this paper, we present a new framework of competition design that promotes the development of algorithms that overcome these barriers. We propose four central mechanisms for achieving this end: submission retraining, domain randomization, desemantization through domain obfuscation, and the limitation of competition compute and environment-sample budget. To demonstrate the efficacy of this design, we proposed, organized, and ran the MineRL 2020 Competition on Sample-Efficient Reinforcement Learning. In this work, we describe the organizational outcomes of the competition and show that the resulting participant submissions are reproducible, non-specific to the competition environment, and sample/resource efficient, despite the difficult competition task.

READ FULL TEXT

page 5

page 10

research
03/10/2020

Retrospective Analysis of the 2019 MineRL Competition on Sample Efficient Reinforcement Learning

To facilitate research in the direction of sample-efficient reinforcemen...
research
03/10/2020

The MineRL Competition on Sample-Efficient Reinforcement Learning Using Human Priors: A Retrospective

To facilitate research in the direction of sample-efficient reinforcemen...
research
01/26/2021

The MineRL 2020 Competition on Sample Efficient Reinforcement Learning using Human Priors

Although deep reinforcement learning has led to breakthroughs in many di...
research
03/29/2021

Measuring Sample Efficiency and Generalization in Reinforcement Learning Benchmarks: NeurIPS 2020 Procgen Benchmark

The NeurIPS 2020 Procgen Competition was designed as a centralized bench...
research
04/22/2019

The MineRL Competition on Sample Efficient Reinforcement Learning using Human Priors

Though deep reinforcement learning has led to breakthroughs in many diff...
research
03/31/2018

Learning to Run challenge: Synthesizing physiologically accurate motion using deep reinforcement learning

Synthesizing physiologically-accurate human movement in a variety of con...
research
05/13/2021

Global Wheat Challenge 2020: Analysis of the competition design and winning models

Data competitions have become a popular approach to crowdsource new data...

Please sign up or login with your details

Forgot password? Click here to reset