Robust Reinforcement Learning via Genetic Curriculum

02/17/2022
by   Yeeho Song, et al.
0

Achieving robust performance is crucial when applying deep reinforcement learning (RL) in safety critical systems. Some of the state of the art approaches try to address the problem with adversarial agents, but these agents often require expert supervision to fine tune and prevent the adversary from becoming too challenging to the trainee agent. While other approaches involve automatically adjusting environment setups during training, they have been limited to simple environments where low-dimensional encodings can be used. Inspired by these approaches, we propose genetic curriculum, an algorithm that automatically identifies scenarios in which the agent currently fails and generates an associated curriculum to help the agent learn to solve the scenarios and acquire more robust behaviors. As a non-parametric optimizer, our approach uses a raw, non-fixed encoding of scenarios, reducing the need for expert supervision and allowing our algorithm to adapt to the changing performance of the agent. Our empirical studies show improvement in robustness over the existing state of the art algorithms, providing training curricula that result in agents being 2 - 8x times less likely to fail without sacrificing cumulative reward. We include an ablation study and share insights on why our algorithm outperforms prior approaches.

READ FULL TEXT
research
05/11/2022

Learning to Guide Multiple Heterogeneous Actors from a Single Human Demonstration via Automatic Curriculum Learning in StarCraft II

Traditionally, learning from human demonstrations via direct behavior cl...
research
12/18/2021

Curriculum Based Reinforcement Learning of Grid Topology Controllers to Prevent Thermal Cascading

This paper describes how domain knowledge of power system operators can ...
research
07/27/2021

Persistent Reinforcement Learning via Subgoal Curricula

Reinforcement learning (RL) promises to enable autonomous acquisition of...
research
12/29/2022

Backward Curriculum Reinforcement Learning

The current reinforcement learning algorithm uses forward-generated traj...
research
02/23/2021

School of hard knocks: Curriculum analysis for Pommerman with a fixed computational budget

Pommerman is a hybrid cooperative/adversarial multi-agent environment, w...
research
05/25/2023

Reward-Machine-Guided, Self-Paced Reinforcement Learning

Self-paced reinforcement learning (RL) aims to improve the data efficien...
research
02/03/2022

Formal Mathematics Statement Curriculum Learning

We explore the use of expert iteration in the context of language modeli...

Please sign up or login with your details

Forgot password? Click here to reset