Learning to Grasp on the Moon from 3D Octree Observations with Deep Reinforcement Learning

08/01/2022
by   Andrej Orsula, et al.
0

Extraterrestrial rovers with a general-purpose robotic arm have many potential applications in lunar and planetary exploration. Introducing autonomy into such systems is desirable for increasing the time that rovers can spend gathering scientific data and collecting samples. This work investigates the applicability of deep reinforcement learning for vision-based robotic grasping of objects on the Moon. A novel simulation environment with procedurally-generated datasets is created to train agents under challenging conditions in unstructured scenes with uneven terrain and harsh illumination. A model-free off-policy actor-critic algorithm is then employed for end-to-end learning of a policy that directly maps compact octree observations to continuous actions in Cartesian space. Experimental evaluation indicates that 3D data representations enable more effective learning of manipulation skills when compared to traditionally used image-based observations. Domain randomization improves the generalization of learned policies to novel scenes with previously unseen objects and different illumination conditions. To this end, we demonstrate zero-shot sim-to-real transfer by evaluating trained agents on a real robot in a Moon-analogue facility.

READ FULL TEXT

page 1

page 4

page 7

research
02/28/2018

Deep Reinforcement Learning for Vision-Based Robotic Grasping: A Simulated Comparative Evaluation of Off-Policy Methods

In this paper, we explore deep reinforcement learning algorithms for vis...
research
09/21/2021

Graph-based Cluttered Scene Generation and Interactive Exploration using Deep Reinforcement Learning

We introduce a novel method to teach a robotic agent to interactively ex...
research
03/04/2022

Learning Goal-Oriented Non-Prehensile Pushing in Cluttered Scenes

Pushing objects through cluttered scenes is a challenging task, especial...
research
05/06/2023

Learning Hybrid Actor-Critic Maps for 6D Non-Prehensile Manipulation

Manipulating objects without grasping them is an essential component of ...
research
10/02/2020

Goal-Auxiliary Actor-Critic for 6D Robotic Grasping with Point Clouds

6D robotic grasping beyond top-down bin-picking scenarios is a challengi...
research
01/22/2020

On Simple Reactive Neural Networks for Behaviour-Based Reinforcement Learning

We present a behaviour-based reinforcement learning approach, inspired b...

Please sign up or login with your details

Forgot password? Click here to reset