BiERL: A Meta Evolutionary Reinforcement Learning Framework via Bilevel Optimization

08/01/2023
by   Junyi Wang, et al.
0

Evolutionary reinforcement learning (ERL) algorithms recently raise attention in tackling complex reinforcement learning (RL) problems due to high parallelism, while they are prone to insufficient exploration or model collapse without carefully tuning hyperparameters (aka meta-parameters). In the paper, we propose a general meta ERL framework via bilevel optimization (BiERL) to jointly update hyperparameters in parallel to training the ERL model within a single agent, which relieves the need for prior domain knowledge or costly optimization procedure before model deployment. We design an elegant meta-level architecture that embeds the inner-level's evolving experience into an informative population representation and introduce a simple and feasible evaluation of the meta-level fitness function to facilitate learning efficiency. We perform extensive experiments in MuJoCo and Box2D tasks to verify that as a general framework, BiERL outperforms various baselines and consistently improves the learning performance for a diversity of ERL algorithms.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/03/2020

Sample-Efficient Automated Deep Reinforcement Learning

Despite significant progress in challenging problems across various doma...
research
12/13/2019

Recruitment-imitation Mechanism for Evolutionary Reinforcement Learning

Reinforcement learning, evolutionary algorithms and imitation learning a...
research
09/21/2022

Learning from Symmetry: Meta-Reinforcement Learning with Symmetric Data and Language Instructions

Meta-reinforcement learning (meta-RL) is a promising approach that enabl...
research
03/03/2020

Learning Context-aware Task Reasoning for Efficient Meta-reinforcement Learning

Despite recent success of deep network-based Reinforcement Learning (RL)...
research
02/06/2020

One-Shot Bayes Opt with Probabilistic Population Based Training

Selecting optimal hyperparameters is a key challenge in machine learning...
research
08/02/2023

Wasserstein Diversity-Enriched Regularizer for Hierarchical Reinforcement Learning

Hierarchical reinforcement learning composites subpolicies in different ...
research
05/21/2021

On the use of feature-maps and parameter control for improved quality-diversity meta-evolution

In Quality-Diversity (QD) algorithms, which evolve a behaviourally diver...

Please sign up or login with your details

Forgot password? Click here to reset