Probabilistically Safe Policy Transfer

05/15/2017
by   David Held, et al.
0

Although learning-based methods have great potential for robotics, one concern is that a robot that updates its parameters might cause large amounts of damage before it learns the optimal policy. We formalize the idea of safe learning in a probabilistic sense by defining an optimization problem: we desire to maximize the expected return while keeping the expected damage below a given safety limit. We study this optimization for the case of a robot manipulator with safety-based torque limits. We would like to ensure that the damage constraint is maintained at every step of the optimization and not just at convergence. To achieve this aim, we introduce a novel method which predicts how modifying the torque limit, as well as how updating the policy parameters, might affect the robot's safety. We show through a number of experiments that our approach allows the robot to improve its performance while ensuring that the expected damage constraint is not violated during the learning process.

READ FULL TEXT

page 1

page 6

research
01/27/2022

SafeAPT: Safe Simulation-to-Real Robot Learning using Diverse Policies Learned in Simulation

The framework of Simulation-to-real learning, i.e, learning policies in ...
research
05/27/2021

GoSafe: Globally Optimal Safe Robot Learning

When learning policies for robotic systems from data, safety is a major ...
research
03/16/2021

Lyapunov Barrier Policy Optimization

Deploying Reinforcement Learning (RL) agents in the real-world require t...
research
10/11/2021

Safe Human-Interactive Control via Shielding

Ensuring safety for human-interactive robotics is important due to the p...
research
10/16/2020

Uncertainty-aware Contact-safe Model-based Reinforcement Learning

This paper presents contact-safe Model-based Reinforcement Learning (MBR...
research
03/09/2022

Dimensionality Reduction and Prioritized Exploration for Policy Search

Black-box policy optimization is a class of reinforcement learning algor...
research
10/02/2019

Deep Reinforcement Learning for Single-Shot Diagnosis and Adaptation in Damaged Robots

Robotics has proved to be an indispensable tool in many industrial as we...

Please sign up or login with your details

Forgot password? Click here to reset