Safe Policy Search with Gaussian Process Models

12/15/2017
by   Kyriakos Polymenakos, et al.
0

We propose a method to optimise the parameters of a policy which will be used to safely perform a given task in a data-efficient manner. We train a Gaussian process model to capture the system dynamics, based on the PILCO framework. Our model has useful analytic properties, which allow closed form computation of error gradients and estimating the probability of violating given state space constraints. During training, as well as operation, only policies that are deemed safe are implemented on the real system, minimising the risk of failure.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/21/2019

Gaussian Process Regression and Classification under Mathematical Constraints with Learning Guarantees

We introduce constrained Gaussian process (CGP), a Gaussian process mode...
research
12/06/2021

Traversing Time with Multi-Resolution Gaussian Process State-Space Models

Gaussian Process state-space models capture complex temporal dependencie...
research
12/10/2018

Closed-form Inference and Prediction in Gaussian Process State-Space Models

We examine an analytic variational inference scheme for the Gaussian Pro...
research
03/04/2018

Process Ordering in a Process Calculus for Spatially-Explicit Ecological Models

In this paper we extend PALPS, a process calculus proposed for the spati...
research
06/04/2019

Uniform Error Bounds for Gaussian Process Regression with Application to Safe Control

Data-driven models are subject to model errors due to limited and noisy ...
research
11/09/2018

Reachability-based safe learning for optimal control problem

In this work we seek for an approach to integrate safety in the learning...
research
05/07/2022

Gaussian Process Self-triggered Policy Search in Weakly Observable Environments

The environments of such large industrial machines as waste cranes in wa...

Please sign up or login with your details

Forgot password? Click here to reset