safe-control-gym: a Unified Benchmark Suite for Safe Learning-based Control and Reinforcement Learning

09/13/2021
by   Zhaocong Yuan, et al.
63

In recent years, reinforcement learning and learning-based control – as well as the study of their safety, crucial for deployment in real-world robots – have gained significant traction. However, to adequately gauge the progress and applicability of new results, we need the tools to equitably compare the approaches proposed by the controls and reinforcement learning communities. Here, we propose a new open-source benchmark suite, called safe-control-gym. Our starting point is OpenAI's Gym API, which is one of the de facto standard in reinforcement learning research. Yet, we highlight the reasons for its limited appeal to control theory researchers – and safe control, in particular. E.g., the lack of analytical models and constraint specifications. Thus, we propose to extend this API with (i) the ability to specify (and query) symbolic models and constraints and (ii) introduce simulated disturbances in the control inputs, measurements, and inertial properties. We provide implementations for three dynamic systems – the cart-pole, 1D, and 2D quadrotor – and two control tasks – stabilization and trajectory tracking. To demonstrate our proposal – and in an attempt to bring research communities closer together – we show how to use safe-control-gym to quantitatively compare the control performance, data efficiency, and safety of multiple approaches from the areas of traditional control, learning-based control, and reinforcement learning.

READ FULL TEXT
research
08/13/2021

Safe Learning in Robotics: From Learning-Based Control to Safe Reinforcement Learning

The last half-decade has seen a steep rise in the number of contribution...
research
10/27/2022

Characterising the Robustness of Reinforcement Learning for Continuous Control using Disturbance Injection

In this study, we leverage the deliberate and systematic fault-injection...
research
08/07/2020

SafePILCO: a software tool for safe and data-efficient policy synthesis

SafePILCO is a software tool for safe and data-efficient policy search w...
research
09/11/2023

The Safety Filter: A Unified View of Safety-Critical Control in Autonomous Systems

Recent years have seen significant progress in the realm of robot autono...
research
01/31/2023

Optimal Transport Perturbations for Safe Reinforcement Learning with Robustness Guarantees

Robustness and safety are critical for the trustworthy deployment of dee...
research
06/03/2021

Safe RAN control: A Symbolic Reinforcement Learning Approach

In this paper, we present a Symbolic Reinforcement Learning (SRL) based ...
research
01/07/2020

Blue River Controls: A toolkit for Reinforcement Learning Control Systems on Hardware

We provide a simple hardware wrapper around the Quanser's hardware-in-th...

Please sign up or login with your details

Forgot password? Click here to reset