Safe Reinforcement Learning via Online Shielding

05/25/2019
by   Osbert Bastani, et al.
0

Reinforcement learning is a promising approach to learning control policies for complex robotics tasks. A key challenge is ensuring safety of the learned control policy---e.g., that a walking robot does not fall over, or a quadcopter does not run into a wall. We focus on the setting where the dynamics are known, and the goal is to prove that a policy learned in simulation satisfies a given safety constraint. Existing approaches for ensuring safety suffer from a number of limitations---e.g., they do not scale to high-dimensional state spaces, or they only ensure safety for a fixed environment. We propose an approach based on shielding, which uses a backup controller to override the learned controller as necessary to ensure that safety holds. Rather than compute when to use the backup controller ahead-of-time, we perform this computation online. By doing so, we ensure that our approach is computationally efficient, and furthermore, can be used to ensure safety even in novel environments. We empirically demonstrate that our approach can ensure safety in experiments on cart-pole and on a bicycle with random obstacles.

READ FULL TEXT
research
10/24/2019

Robust Model Predictive Shielding for Safe Reinforcement Learning with Stochastic Dynamics

This paper proposes a framework for safe reinforcement learning that can...
research
10/11/2021

Safe Human-Interactive Control via Shielding

Ensuring safety for human-interactive robotics is important due to the p...
research
08/17/2020

Runtime-Safety-Guided Policy Repair

We study the problem of policy repair for learning-based control policie...
research
09/21/2023

Learning to Recover for Safe Reinforcement Learning

Safety controllers is widely used to achieve safe reinforcement learning...
research
12/17/2020

Online Shielding for Stochastic Systems

In this paper, we propose a method to develop trustworthy reinforcement ...
research
10/25/2021

Safely Bridging Offline and Online Reinforcement Learning

A key challenge to deploying reinforcement learning in practice is explo...
research
04/19/2023

Model Based Reinforcement Learning for Personalized Heparin Dosing

A key challenge in sequential decision making is optimizing systems safe...

Please sign up or login with your details

Forgot password? Click here to reset