Fast Population-Based Reinforcement Learning on a Single Machine

06/17/2022
by   Arthur Flajolet, et al.
0

Training populations of agents has demonstrated great promise in Reinforcement Learning for stabilizing training, improving exploration and asymptotic performance, and generating a diverse set of solutions. However, population-based training is often not considered by practitioners as it is perceived to be either prohibitively slow (when implemented sequentially), or computationally expensive (if agents are trained in parallel on independent accelerators). In this work, we compare implementations and revisit previous studies to show that the judicious use of compilation and vectorization allows population-based training to be performed on a single machine with one accelerator with minimal overhead compared to training a single agent. We also show that, when provided with a few accelerators, our protocols extend to large population sizes for applications such as hyperparameter tuning. We hope that this work and the public release of our code will encourage practitioners to use population-based learning more frequently for their research and applications.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/08/2022

A Survey on Large-Population Systems and Scalable Multi-Agent Reinforcement Learning

The analysis and control of large-population systems is of great interes...
research
05/16/2023

An Empirical Study on Google Research Football Multi-agent Scenarios

Few multi-agent reinforcement learning (MARL) research on Google Researc...
research
03/23/2020

Evolutionary Population Curriculum for Scaling Multi-Agent Reinforcement Learning

In multi-agent games, the complexity of the environment can grow exponen...
research
03/10/2018

Kickstarting Deep Reinforcement Learning

We present a method for using previously-trained 'teacher' agents to kic...
research
09/30/2021

Genealogical Population-Based Training for Hyperparameter Optimization

Hyperparameter optimization aims at finding more rapidly and efficiently...
research
01/24/2021

Learning Synthetic Environments for Reinforcement Learning with Evolution Strategies

This work explores learning agent-agnostic synthetic environments (SEs) ...
research
11/06/2019

To Populate is To Regulate

We examine the effects of instantiating Lewis signaling games within a p...

Please sign up or login with your details

Forgot password? Click here to reset