GUARD: A Safe Reinforcement Learning Benchmark

05/23/2023
by   Weiye Zhao, et al.
0

Due to the trial-and-error nature, it is typically challenging to apply RL algorithms to safety-critical real-world applications, such as autonomous driving, human-robot interaction, robot manipulation, etc, where such errors are not tolerable. Recently, safe RL (i.e. constrained RL) has emerged rapidly in the literature, in which the agents explore the environment while satisfying constraints. Due to the diversity of algorithms and tasks, it remains difficult to compare existing safe RL algorithms. To fill that gap, we introduce GUARD, a Generalized Unified SAfe Reinforcement Learning Development Benchmark. GUARD has several advantages compared to existing benchmarks. First, GUARD is a generalized benchmark with a wide variety of RL agents, tasks, and safety constraint specifications. Second, GUARD comprehensively covers state-of-the-art safe RL algorithms with self-contained implementations. Third, GUARD is highly customizable in tasks and algorithms. We present a comparison of state-of-the-art safe RL algorithms in various task settings using GUARD and establish baselines that future work can build on.

READ FULL TEXT

page 7

page 28

page 29

page 30

research
02/06/2023

State-wise Safe Reinforcement Learning: A Survey

Despite the tremendous success of Reinforcement Learning (RL) algorithms...
research
06/15/2023

Datasets and Benchmarks for Offline Safe Reinforcement Learning

This paper presents a comprehensive benchmarking suite tailored to offli...
research
04/02/2022

Safe Reinforcement Learning via Shielding for POMDPs

Reinforcement learning (RL) in safety-critical environments requires an ...
research
12/02/2020

Safe Reinforcement Learning for Antenna Tilt Optimisation using Shielding and Multiple Baselines

Safe interaction with the environment is one of the most challenging asp...
research
12/12/2022

Evaluating Model-free Reinforcement Learning toward Safety-critical Tasks

Safety comes first in many real-world applications involving autonomous ...
research
05/08/2023

DEFENDER: DTW-Based Episode Filtering Using Demonstrations for Enhancing RL Safety

Deploying reinforcement learning agents in the real world can be challen...
research
07/08/2022

Safe reinforcement learning for multi-energy management systems with known constraint functions

Reinforcement learning (RL) is a promising optimal control technique for...

Please sign up or login with your details

Forgot password? Click here to reset