Contextual Reinforcement Learning of Visuo-tactile Multi-fingered Grasping Policies

11/21/2019
by   Visak Kumar, et al.
0

Using simulation to train robot manipulation policies holds the promise of an almost unlimited amount of training data, generated safely out of harm's way. One of the key challenges of using simulation, to date, has been to bridge the reality gap, so that policies trained in simulation can be deployed in the real world. We explore the reality gap in the context of learning a contextual policy for multi-fingered robotic grasping. We propose a Grasping Objects Approach for Tactile (GOAT) robotic hands, learning to overcome the reality gap problem. In our approach we use human hand motion demonstration to initialize and reduce the search space for learning. We contextualize our policy with the bounding cuboid dimensions of the object of interest, which allows the policy to work on a more flexible representation than directly using an image or point cloud. Leveraging fingertip touch sensors in the hand allows the policy to overcome the reduction in geometric information introduced by the coarse bounding box, as well as pose estimation uncertainty. We show our learned policy successfully runs on a real robot without any fine tuning, thus bridging the reality gap.

READ FULL TEXT

page 2

page 6

research
09/27/2018

Deep Object Pose Estimation for Semantic Robotic Grasping of Household Objects

Using synthetic data for training deep neural networks for robotic manip...
research
05/08/2020

A Monte Carlo Approach to Closing the Reality Gap

We propose a novel approach to the 'reality gap' problem, i.e., modifyin...
research
10/24/2022

Learning Robust Real-World Dexterous Grasping Policies via Implicit Shape Augmentation

Dexterous robotic hands have the capability to interact with a wide vari...
research
05/17/2023

Crossing the Reality Gap in Tactile-Based Learning

Tactile sensors are believed to be essential in robotic manipulation, an...
research
08/06/2021

OHPL: One-shot Hand-eye Policy Learner

The control of a robot for manipulation tasks generally relies on object...
research
07/27/2018

Adapting control policies from simulation to reality using a pairwise loss

This paper proposes an approach to domain transfer based on a pairwise l...
research
03/08/2019

Pixel-Attentive Policy Gradient for Multi-Fingered Grasping in Cluttered Scenes

Recent advances in on-policy reinforcement learning (RL) methods enabled...

Please sign up or login with your details

Forgot password? Click here to reset