Scalable Synthesis of Verified Controllers in Deep Reinforcement Learning

04/20/2021
by   Zikang Xiong, et al.
0

There has been significant recent interest in devising verification techniques for learning-enabled controllers (LECs) that manage safety-critical systems. Given the opacity and lack of interpretability of the neural policies that govern the behavior of such controllers, many existing approaches enforce safety properties through the use of shields, a dynamic monitoring and repair mechanism that ensures a LEC does not emit actions that would violate desired safety conditions. These methods, however, have shown to have significant scalability limitations because verification costs grow as problem dimensionality and objective complexity increase. In this paper, we propose a new automated verification pipeline capable of synthesizing high-quality safety shields even when the problem domain involves hundreds of dimensions, or when the desired objective involves stochastic perturbations, liveness considerations, and other complex non-functional properties. Our key insight involves separating safety verification from neural controller, using pre-computed verified safety shields to constrain neural controller training which does not only focus on safety. Experimental results over a range of realistic high-dimensional deep RL benchmarks demonstrate the effectiveness of our approach.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/20/2020

Runtime Safety Assurance Using Reinforcement Learning

The airworthiness and safety of a non-pedigreed autopilot must be verifi...
research
10/21/2020

Safety Verification of Model Based Reinforcement Learning Controllers

Model-based reinforcement learning (RL) has emerged as a promising tool ...
research
03/11/2022

Deep Binary Reinforcement Learning for Scalable Verification

The use of neural networks as function approximators has enabled many ad...
research
01/28/2022

Joint Differentiable Optimization and Verification for Certified Reinforcement Learning

In model-based reinforcement learning for safety-critical control system...
research
03/12/2019

Blackbox End-to-End Verification of Ground Robot Safety and Liveness

We formally prove end-to-end correctness of a ground robot implemented i...
research
06/05/2023

Conformal Predictive Safety Filter for RL Controllers in Dynamic Environments

The interest in using reinforcement learning (RL) controllers in safety-...
research
07/16/2019

An Inductive Synthesis Framework for Verifiable Reinforcement Learning

Despite the tremendous advances that have been made in the last decade o...

Please sign up or login with your details

Forgot password? Click here to reset