Approximate Model-Based Shielding for Safe Reinforcement Learning

07/27/2023
by   Alexander W. Goodall, et al.
0

Reinforcement learning (RL) has shown great potential for solving complex tasks in a variety of domains. However, applying RL to safety-critical systems in the real-world is not easy as many algorithms are sample-inefficient and maximising the standard RL objective comes with no guarantees on worst-case performance. In this paper we propose approximate model-based shielding (AMBS), a principled look-ahead shielding algorithm for verifying the performance of learned RL policies w.r.t. a set of given safety constraints. Our algorithm differs from other shielding approaches in that it does not require prior knowledge of the safety-relevant dynamics of the system. We provide a strong theoretical justification for AMBS and demonstrate superior performance to other safety-aware approaches on a set of Atari games with state-dependent safety-labels.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/13/2022

Provably Safe Reinforcement Learning: A Theoretical and Experimental Comparison

Ensuring safety of reinforcement learning (RL) algorithms is crucial for...
research
06/21/2019

Reinforcement Learning with Convex Constraints

In standard reinforcement learning (RL), a learning agent seeks to optim...
research
04/21/2023

Approximate Shielding of Atari Agents for Safe Exploration

Balancing exploration and conservatism in the constrained setting is an ...
research
03/02/2023

Data-efficient, Explainable and Safe Payload Manipulation: An Illustration of the Advantages of Physical Priors in Model-Predictive Control

Machine Learning methods, such as those from the Reinforcement Learning ...
research
02/07/2023

Adaptive Aggregation for Safety-Critical Control

Safety has been recognized as the central obstacle to preventing the use...
research
07/27/2022

Dynamic Shielding for Reinforcement Learning in Black-Box Environments

It is challenging to use reinforcement learning (RL) in cyber-physical s...
research
12/12/2022

Evaluating Model-free Reinforcement Learning toward Safety-critical Tasks

Safety comes first in many real-world applications involving autonomous ...

Please sign up or login with your details

Forgot password? Click here to reset