SafePILCO: a software tool for safe and data-efficient policy synthesis

08/07/2020
by   Kyriakos Polymenakos, et al.
0

SafePILCO is a software tool for safe and data-efficient policy search with reinforcement learning. It extends the known PILCO algorithm, originally written in MATLAB, to support safe learning. We provide a Python implementation and leverage existing libraries that allow the codebase to remain short and modular, which is appropriate for wider use by the verification, reinforcement learning, and control communities.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/10/2021

Data Generation Method for Learning a Low-dimensional Safe Region in Safe Reinforcement Learning

Safe reinforcement learning aims to learn a control policy while ensurin...
research
09/13/2021

safe-control-gym: a Unified Benchmark Suite for Safe Learning-based Control and Reinforcement Learning

In recent years, reinforcement learning and learning-based control – as ...
research
08/13/2021

Safe Learning in Robotics: From Learning-Based Control to Safe Reinforcement Learning

The last half-decade has seen a steep rise in the number of contribution...
research
09/26/2020

Neurosymbolic Reinforcement Learning with Formally Verified Exploration

We present Revel, a partially neural reinforcement learning (RL) framewo...
research
02/08/2016

Data-Efficient Reinforcement Learning in Continuous-State POMDPs

We present a data-efficient reinforcement learning algorithm resistant t...
research
01/19/2019

Towards Physically Safe Reinforcement Learning under Supervision

This paper addresses the question of how a previously available control ...
research
02/26/2022

Safe Exploration for Efficient Policy Evaluation and Comparison

High-quality data plays a central role in ensuring the accuracy of polic...

Please sign up or login with your details

Forgot password? Click here to reset