Collaborative Evolutionary Reinforcement Learning

05/02/2019
by   Shauharda Khadka, et al.
0

Deep reinforcement learning algorithms have been successfully applied to a range of challenging control tasks. However, these methods typically struggle with achieving effective exploration and are extremely sensitive to the choice of hyperparameters. One reason is that most approaches use a noisy version of their operating policy to explore - thereby limiting the range of exploration. In this paper, we introduce Collaborative Evolutionary Reinforcement Learning (CERL), a scalable framework that comprises a portfolio of policies that simultaneously explore and exploit diverse regions of the solution space. A collection of learners - typically proven algorithms like TD3 - optimize over varying time-horizons leading to this diverse portfolio. All learners contribute to and use a shared replay buffer to achieve greater sample efficiency. Computational resources are dynamically distributed to favor the best learners as a form of online algorithm selection. Neuroevolution binds this entire process to generate a single emergent learner that exceeds the capabilities of any individual learner. Experiments in a range of continuous control benchmarks demonstrate that the emergent learner significantly outperforms its composite learners while remaining overall more sample-efficient - notably solving the Mujoco Humanoid benchmark where all of its composite learners (TD3) fail entirely in isolation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/21/2018

Evolutionary Reinforcement Learning

Deep Reinforcement Learning (DRL) algorithms have been successfully appl...
research
12/13/2019

Recruitment-imitation Mechanism for Evolutionary Reinforcement Learning

Reinforcement learning, evolutionary algorithms and imitation learning a...
research
06/20/2023

Evolutionary Strategy Guided Reinforcement Learning via MultiBuffer Communication

Evolutionary Algorithms and Deep Reinforcement Learning have both succes...
research
02/07/2019

Metaoptimization on a Distributed System for Deep Reinforcement Learning

Training intelligent agents through reinforcement learning is a notoriou...
research
09/29/2022

Hierarchical Training of Deep Ensemble Policies for Reinforcement Learning in Continuous Spaces

Many actor-critic deep reinforcement learning (DRL) algorithms have achi...
research
12/02/2018

Efficient Lifelong Learning with A-GEM

In lifelong learning, the learner is presented with a sequence of tasks,...

Please sign up or login with your details

Forgot password? Click here to reset