Survivable Robotic Control through Guided Bayesian Policy Search with Deep Reinforcement Learning

06/29/2021
by   Sayyed Jaffar Ali Raza, et al.
0

Many robot manipulation skills can be represented with deterministic characteristics and there exist efficient techniques for learning parameterized motor plans for those skills. However, one of the active research challenge still remains to sustain manipulation capabilities in situation of a mechanical failure. Ideally, like biological creatures, a robotic agent should be able to reconfigure its control policy by adapting to dynamic adversaries. In this paper, we propose a method that allows an agent to survive in a situation of mechanical loss, and adaptively learn manipulation with compromised degrees of freedom – we call our method Survivable Robotic Learning (SRL). Our key idea is to leverage Bayesian policy gradient by encoding knowledge bias in posterior estimation, which in turn alleviates future policy search explorations, in terms of sample efficiency and when compared to random exploration based policy search methods. SRL represents policy priors as Gaussian process, which allows tractable computation of approximate posterior (when true gradient is intractable), by incorporating guided bias as proxy from prior replays. We evaluate our proposed method against off-the-shelf model free learning algorithm (DDPG), testing on a hexapod robot platform which encounters incremental failure emulation, and our experiments show that our method improves largely in terms of sample requirement and quantitative success ratio in all failure modes. A demonstration video of our experiments can be viewed at: https://sites.google.com/view/survivalrl

READ FULL TEXT

page 1

page 6

research
10/20/2020

Survivable Hyper-Redundant Robotic Arm with Bayesian Policy Morphing

In this paper we present a Bayesian reinforcement learning framework tha...
research
03/19/2018

Composable Deep Reinforcement Learning for Robotic Manipulation

Model-free deep reinforcement learning has been shown to exhibit good pe...
research
09/15/2018

Learning Robust Manipulation Skills with Guided Policy Search via Generative Motor Reflexes

Guided Policy Search enables robots to learn control policies for comple...
research
05/19/2022

Dexterous Robotic Manipulation using Deep Reinforcement Learning and Knowledge Transfer for Complex Sparse Reward-based Tasks

This paper describes a deep reinforcement learning (DRL) approach that w...
research
03/09/2022

On-Robot Policy Learning with O(2)-Equivariant SAC

Recently, equivariant neural network models have been shown to be useful...
research
07/07/2023

Polybot: Training One Policy Across Robots While Embracing Variability

Reusing large datasets is crucial to scale vision-based robotic manipula...
research
11/13/2015

Active Contextual Entropy Search

Contextual policy search allows adapting robotic movement primitives to ...

Please sign up or login with your details

Forgot password? Click here to reset