NAPPO: Modular and scalable reinforcement learning in pytorch

07/06/2020
by   Albert Bou, et al.
0

Reinforcement learning (RL) has been very successful in recent years but, limited by its sample inefficiency, often requires large computational resources. While new methods are being investigated to increase the efficiency of RL algorithms it is critical to enable training at scale, yet using a code-base flexible enough to allow for method experimentation. Here, we present NAPPO, a pytorch-based library for RL which provides scalable proximal policy optimization (PPO) implementations in a simple, modular package. We validate it by replicating previous results on Mujoco and Atari environments. Furthermore, we provide insights on how a variety of distributed training schemes with synchronous and asynchronous communication patterns perform. Finally we showcase NAPPO by obtaining the highest to-date test performance on the Obstacle Tower Unity3D challenge environment. The full source code is available.

READ FULL TEXT

page 4

page 5

research
02/08/2022

skrl: Modular and Flexible Library for Reinforcement Learning

skrl is an open-source modular library for reinforcement learning writte...
research
06/16/2023

Jumanji: a Diverse Suite of Scalable Reinforcement Learning Environments in JAX

Open-source reinforcement learning (RL) environments have played a cruci...
research
12/28/2019

SLM Lab: A Comprehensive Benchmark and Modular Software Framework for Reproducible Deep Reinforcement Learning

We introduce SLM Lab, a software framework for reproducible reinforcemen...
research
11/30/2019

IMPACT: Importance Weighted Asynchronous Architectures with Clipped Target Networks

The practical usage of reinforcement learning agents is often bottleneck...
research
09/27/2019

SURREAL-System: Fully-Integrated Stack for Distributed Deep Reinforcement Learning

We present an overview of SURREAL-System, a reproducible, flexible, and ...
research
12/03/2021

Reinforcement Learning-Based Automatic Berthing System

Previous studies on automatic berthing systems based on artificial neura...
research
08/07/2023

QDax: A Library for Quality-Diversity and Population-based Algorithms with Hardware Acceleration

QDax is an open-source library with a streamlined and modular API for Qu...

Please sign up or login with your details

Forgot password? Click here to reset