Safe-To-Explore State Spaces: Ensuring Safe Exploration in Policy Search with Hierarchical Task Optimization

10/08/2018
by   Jens Lundell, et al.
0

Policy search reinforcement learning allows robots to acquire skills by themselves. However, the learning procedure is inherently unsafe as the robot has no a-priori way to predict the consequences of the exploratory actions it takes. Therefore, exploration can lead to collisions with the potential to harm the robot and/or the environment. In this work we address the safety aspect by constraining the exploration to happen in safe-to-explore state spaces. These are formed by decomposing target skills (e.g., grasping) into higher ranked sub-tasks (e.g., collision avoidance, joint limit avoidance) and lower ranked movement tasks (e.g., reaching). Sub-tasks are defined as concurrent controllers (policies) in different operational spaces together with associated Jacobians representing their joint-space mapping. Safety is ensured by only learning policies corresponding to lower ranked sub-tasks in the redundant null space of higher ranked ones. As a side benefit, learning in sub-manifolds of the state-space also facilitates sample efficiency. Reaching skills performed in simulation and grasping skills performed on a real robot validate the usefulness of the proposed approach.

READ FULL TEXT

page 1

page 3

page 4

page 5

page 7

research
03/06/2023

Value Guided Exploration with Sub-optimal Controllers for Learning Dexterous Manipulation

Recently, reinforcement learning has allowed dexterous manipulation skil...
research
02/11/2020

Reaching, Grasping and Re-grasping: Learning Multimode Grasping Skills

The ability to adapt to uncertainties, recover from failures, and coordi...
research
02/11/2020

Reaching, Grasping and Re-grasping: Learning Fine Coordinated Motor Skills

The ability to adapt to uncertainties, recover from failures, and sensor...
research
02/10/2022

SAFER: Data-Efficient and Safe Reinforcement Learning via Skill Acquisition

Though many reinforcement learning (RL) problems involve learning polici...
research
10/15/2021

Dual-Arm Adversarial Robot Learning

Robot learning is a very promising topic for the future of automation an...
research
10/31/2022

Allowing Safe Contact in Robotic Goal-Reaching: Planning and Tracking in Operational and Null Spaces

In recent years, impressive results have been achieved in robotic manipu...
research
07/24/2023

SafeSteps: Learning Safer Footstep Planning Policies for Legged Robots via Model-Based Priors

We present a footstep planning policy for quadrupedal locomotion that is...

Please sign up or login with your details

Forgot password? Click here to reset