A computationally lightweight safe learning algorithm

09/07/2023
by   Dominik Baumann, et al.
0

Safety is an essential asset when learning control policies for physical systems, as violating safety constraints during training can lead to expensive hardware damage. In response to this need, the field of safe learning has emerged with algorithms that can provide probabilistic safety guarantees without knowledge of the underlying system dynamics. Those algorithms often rely on Gaussian process inference. Unfortunately, Gaussian process inference scales cubically with the number of data points, limiting applicability to high-dimensional and embedded systems. In this paper, we propose a safe learning algorithm that provides probabilistic safety guarantees but leverages the Nadaraya-Watson estimator instead of Gaussian processes. For the Nadaraya-Watson estimator, we can reach logarithmic scaling with the number of data points. We provide theoretical guarantees for the estimates, embed them into a safe learning algorithm, and show numerical experiments on a simulated seven-degrees-of-freedom robot manipulator.

READ FULL TEXT
research
01/24/2022

Scalable Safe Exploration for Global Optimization of Dynamical Systems

Learning optimal control policies directly on physical systems is challe...
research
05/23/2017

Safe Model-based Reinforcement Learning with Stability Guarantees

Reinforcement learning is a powerful paradigm for learning optimal polic...
research
09/19/2022

Safety Index Synthesis via Sum-of-Squares Programming

Control systems often need to satisfy strict safety requirements. Safety...
research
10/07/2019

A Learnable Safety Measure

Failures are challenging for learning to control physical systems since ...
research
05/17/2022

Can We Do Better Than Random Start? The Power of Data Outsourcing

Many organizations have access to abundant data but lack the computation...
research
03/28/2022

Safe Active Learning for Multi-Output Gaussian Processes

Multi-output regression problems are commonly encountered in science and...
research
02/02/2021

Symplectic Gaussian Process Dynamics

Dynamics model learning is challenging and at the same time an active fi...

Please sign up or login with your details

Forgot password? Click here to reset