Information-Theoretic Safe Exploration with Gaussian Processes

12/09/2022
by   Alessandro G. Bottero, et al.
0

We consider a sequential decision making task where we are not allowed to evaluate parameters that violate an a priori unknown (safety) constraint. A common approach is to place a Gaussian process prior on the unknown constraint and allow evaluations only in regions that are safe with high probability. Most current methods rely on a discretization of the domain and cannot be directly extended to the continuous case. Moreover, the way in which they exploit regularity assumptions about the constraint introduces an additional critical hyperparameter. In this paper, we propose an information-theoretic safe exploration criterion that directly exploits the GP posterior to identify the most informative safe parameters to evaluate. Our approach is naturally applicable to continuous domains and does not require additional hyperparameters. We theoretically analyze the method and show that we do not violate the safety constraint with high probability and that we explore by learning about the constraint up to arbitrary precision. Empirical evaluations demonstrate improved data-efficiency and scalability.

READ FULL TEXT
research
11/10/2022

Adaptive Real Time Exploration and Optimization for Safety-Critical Systems

We consider the problem of decision-making under uncertainty in an envir...
research
06/15/2016

Safe Exploration in Finite Markov Decision Processes with Gaussian Processes

In classical reinforcement learning, when exploring an environment, agen...
research
12/08/2021

Gaussian Process Constraint Learning for Scalable Chance-Constrained Motion Planning from Demonstrations

We propose a method for learning constraints represented as Gaussian pro...
research
05/05/2020

Regret Bounds for Safe Gaussian Process Bandit Optimization

Many applications require a learner to make sequential decisions given u...
research
06/13/2019

Robust Regression for Safe Exploration in Control

We study the problem of safe learning and exploration in sequential cont...
research
11/03/2022

Benefits of Monotonicity in Safe Exploration with Gaussian Processes

We consider the problem of sequentially maximising an unknown function o...
research
04/22/2022

SCOPE: Safe Exploration for Dynamic Computer Systems Optimization

Modern computer systems need to execute under strict safety constraints ...

Please sign up or login with your details

Forgot password? Click here to reset