Empirical Design in Reinforcement Learning

04/03/2023
by   Andrew Patterson, et al.
0

Empirical design in reinforcement learning is no small task. Running good experiments requires attention to detail and at times significant computational resources. While compute resources available per dollar have continued to grow rapidly, so have the scale of typical experiments in reinforcement learning. It is now common to benchmark agents with millions of parameters against dozens of tasks, each using the equivalent of 30 days of experience. The scale of these experiments often conflict with the need for proper statistical evidence, especially when comparing algorithms. Recent studies have highlighted how popular algorithms are sensitive to hyper-parameter settings and implementation details, and that common empirical practice leads to weak statistical evidence (Machado et al., 2018; Henderson et al., 2018). Here we take this one step further. This manuscript represents both a call to action, and a comprehensive resource for how to do good experiments in reinforcement learning. In particular, we cover: the statistical assumptions underlying common performance measures, how to properly characterize performance variation and stability, hypothesis testing, special considerations for comparing multiple agents, baseline and illustrative example construction, and how to deal with hyper-parameters and experimenter bias. Throughout we highlight common mistakes found in the literature and the statistical consequences of those in example experiments. The objective of this document is to provide answers on how we can use our unprecedented compute to do good science in reinforcement learning, as well as stay alert to potential pitfalls in our empirical design.

READ FULL TEXT
research
01/28/2023

Towards Learning Rubik's Cube with N-tuple-based Reinforcement Learning

This work describes in detail how to learn and solve the Rubik's cube ga...
research
05/22/2017

AIXIjs: A Software Demo for General Reinforcement Learning

Reinforcement learning is a general and powerful framework with which to...
research
12/02/2022

Selecting Mechanical Parameters of a Monopode Jumping System with Reinforcement Learning

Legged systems have many advantages when compared to their wheeled count...
research
06/02/2022

Equivariant Reinforcement Learning for Quadrotor UAV

This paper presents an equivariant reinforcement learning framework for ...
research
11/20/2020

Revisiting Rainbow: Promoting more insightful and inclusive deep reinforcement learning research

Since the introduction of DQN, a vast majority of reinforcement learning...
research
09/07/2021

On the impact of MDP design for Reinforcement Learning agents in Resource Management

The recent progress in Reinforcement Learning applications to Resource M...
research
07/20/2023

Exploring reinforcement learning techniques for discrete and continuous control tasks in the MuJoCo environment

We leverage the fast physics simulator, MuJoCo to run tasks in a continu...

Please sign up or login with your details

Forgot password? Click here to reset