Certification of Iterative Predictions in Bayesian Neural Networks

05/21/2021
by   Matthew Wicker, et al.
7

We consider the problem of computing reach-avoid probabilities for iterative predictions made with Bayesian neural network (BNN) models. Specifically, we leverage bound propagation techniques and backward recursion to compute lower bounds for the probability that trajectories of the BNN model reach a given set of states while avoiding a set of unsafe states. We use the lower bounds in the context of control and reinforcement learning to provide safety certification for given control policies, as well as to synthesize control policies that improve the certification bounds. On a set of benchmarks, we demonstrate that our framework can be employed to certify policies over BNNs predictions for problems of more than 10 dimensions, and to effectively synthesize policies that significantly increase the lower bound on the satisfaction probability.

READ FULL TEXT

page 4

page 7

page 11

page 12

page 13

page 14

research
03/15/2022

Modern Lower Bound Techniques in Database Theory and Constraint Satisfaction

Conditional lower bounds based on P≠ NP, the Exponential-Time Hypothesis...
research
11/29/2019

Safety Guarantees for Planning Based on Iterative Gaussian Processes

Gaussian Processes (GPs) are widely employed in control and learning bec...
research
02/17/2018

Lower Bounds on Sparse Spanners, Emulators, and Diameter-reducing shortcuts

We prove better lower bounds on additive spanners and emulators, which a...
research
01/13/2023

Constriction for sets of probabilities

Given a set of probability measures 𝒫 representing an agent's knowledge ...
research
06/18/2011

Robust Bayesian reinforcement learning through tight lower bounds

In the Bayesian approach to sequential decision making, exact calculatio...
research
05/10/2021

Gradient-based Bayesian Experimental Design for Implicit Models using Mutual Information Lower Bounds

We introduce a framework for Bayesian experimental design (BED) with imp...
research
03/02/2023

Hallucinated Adversarial Control for Conservative Offline Policy Evaluation

We study the problem of conservative off-policy evaluation (COPE) where ...

Please sign up or login with your details

Forgot password? Click here to reset