Sample Efficient Ensemble Learning with Catalyst.RL

03/29/2020
by   Sergey Kolesnikov, et al.
21

We present Catalyst.RL, an open-source PyTorch framework for reproducible and sample efficient reinforcement learning (RL) research. Main features of Catalyst.RL include large-scale asynchronous distributed training, efficient implementations of various RL algorithms and auxiliary tricks, such as n-step returns, value distributions, hyperbolic reinforcement learning, etc. To demonstrate the effectiveness of Catalyst.RL, we applied it to a physics-based reinforcement learning challenge "NeurIPS 2019: Learn to Move - Walk Around" with the objective to build a locomotion controller for a human musculoskeletal model. The environment is computationally expensive, has a high-dimensional continuous action space and is stochastic. Our team took the 2nd place, capitalizing on the ability of Catalyst.RL to train high-quality and sample-efficient RL agents in only a few hours of training time. The implementation along with experiments is open-sourced so results can be reproduced and novel ideas tried out.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/28/2019

Catalyst.RL: A Distributed Framework for Reproducible RL Research

Despite the recent progress in deep reinforcement learning field (RL), a...
research
11/30/2019

IMPACT: Importance Weighted Asynchronous Architectures with Clipped Target Networks

The practical usage of reinforcement learning agents is often bottleneck...
research
09/13/2023

Efficient Reinforcement Learning for Jumping Monopods

In this work, we consider the complex control problem of making a monopo...
research
06/29/2023

SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores

The ever-growing complexity of reinforcement learning (RL) tasks demands...
research
11/19/2021

Fast and Data-Efficient Training of Rainbow: an Experimental Study on Atari

Across the Arcade Learning Environment, Rainbow achieves a level of perf...
research
04/20/2022

A Reinforcement Learning-based Volt-VAR Control Dataset and Testing Environment

To facilitate the development of reinforcement learning (RL) based power...
research
05/19/2020

Privileged Information Dropout in Reinforcement Learning

Using privileged information during training can improve the sample effi...

Please sign up or login with your details

Forgot password? Click here to reset