Contextual Latent-Movements Off-Policy Optimization for Robotic Manipulation Skills

10/26/2020
by   Samuele Tosatto, et al.
1

Parameterized movement primitives have been extensively used for imitation learning of robotic tasks. However, the high-dimensionality of the parameter space hinders the improvement of such primitives in the reinforcement learning (RL) setting, especially for learning with physical robots. In this paper we propose a novel view on handling the demonstrated trajectories for acquiring low-dimensional, non-linear latent dynamics, using mixtures of probabilistic principal component analyzers (MPPCA) on the movements' parameter space. Moreover, we introduce a new contextual off-policy RL algorithm, named LAtent-Movements Policy Optimization (LAMPO). LAMPO can provide gradient estimates from previous experience using self-normalized importance sampling, hence, making full use of samples collected in previous learning iterations. These advantages combined provide a complete framework for sample-efficient off-policy optimization of movement primitives for robot learning of high-dimensional manipulation skills. Our experimental results conducted both in simulation and on a real robot show that LAMPO provides sample-efficient policies against common approaches in literature.

READ FULL TEXT

page 1

page 6

page 7

research
02/26/2020

Dimensionality Reduction of Movement Primitives in Parameter Space

Movement primitives are an important policy class for real-world robotic...
research
09/27/2021

Learning of Parameters in Behavior Trees for Movement Skills

Reinforcement Learning (RL) is a powerful mathematical framework that al...
research
11/09/2020

Reward Conditioned Neural Movement Primitives for Population Based Variational Policy Optimization

The aim of this paper is to study the reward based policy exploration pr...
research
10/11/2020

Deep Imitation Learning for Bimanual Robotic Manipulation

We present a deep imitation learning framework for robotic bimanual mani...
research
10/28/2021

Accelerating Robotic Reinforcement Learning via Parameterized Action Primitives

Despite the potential of reinforcement learning (RL) for building genera...
research
10/14/2022

Geometric Reinforcement Learning: The Case of Cartesian Space Orientation

Reinforcement learning (RL) enables an agent to learn by trial and error...
research
03/11/2021

Controlled Gaussian Process Dynamical Models with Application to Robotic Cloth Manipulation

Over the last years, robotic cloth manipulation has gained relevance wit...

Please sign up or login with your details

Forgot password? Click here to reset