Value Functions are Control Barrier Functions: Verification of Safe Policies using Control Theory

06/06/2023
by   Daniel C. H. Tan, et al.
0

Guaranteeing safe behaviour of reinforcement learning (RL) policies poses significant challenges for safety-critical applications, despite RL's generality and scalability. To address this, we propose a new approach to apply verification methods from control theory to learned value functions. By analyzing task structures for safety preservation, we formalize original theorems that establish links between value functions and control barrier functions. Further, we propose novel metrics for verifying value functions in safe control tasks and practical implementation details to improve learning. Our work presents a novel method for certificate learning, which unlocks a diversity of verification techniques from control theory for RL policies, and marks a significant step towards a formal framework for the general, scalable, and verifiable design of RL-based control systems.

READ FULL TEXT
research
10/11/2021

Safe Model-Based Reinforcement Learning Using Robust Control Barrier Functions

Reinforcement Learning (RL) is effective in many scenarios. However, it ...
research
07/04/2022

Safe Reinforcement Learning via Confidence-Based Filters

Ensuring safety is a crucial challenge when deploying reinforcement lear...
research
05/25/2021

Safe Value Functions

The relationship between safety and optimality in control is not well un...
research
12/02/2021

Safe Reinforcement Learning for Grid Voltage Control

Under voltage load shedding has been considered as a standard approach t...
research
04/26/2021

CPS Engineering: Gap Analysis and Perspectives

Virtualization of computing and networking, IT-OT convergence, cybersecu...
research
04/26/2022

Refining Control Barrier Functions through Hamilton-Jacobi Reachability

Safety filters based on Control Barrier Functions (CBFs) have emerged as...
research
09/07/2021

Safe-Critical Modular Deep Reinforcement Learning with Temporal Logic through Gaussian Processes and Control Barrier Functions

Reinforcement learning (RL) is a promising approach and has limited succ...

Please sign up or login with your details

Forgot password? Click here to reset