On the Efficacy of 3D Point Cloud Reinforcement Learning

06/11/2023
by   Zhan Ling, et al.
0

Recent studies on visual reinforcement learning (visual RL) have explored the use of 3D visual representations. However, none of these work has systematically compared the efficacy of 3D representations with 2D representations across different tasks, nor have they analyzed 3D representations from the perspective of agent-object / object-object relationship reasoning. In this work, we seek answers to the question of when and how do 3D neural networks that learn features in the 3D-native space provide a beneficial inductive bias for visual RL. We specifically focus on 3D point clouds, one of the most common forms of 3D representations. We systematically investigate design choices for 3D point cloud RL, leading to the development of a robust algorithm for various robotic manipulation and control tasks. Furthermore, through comparisons between 2D image vs 3D point cloud RL methods on both minimalist synthetic tasks and complex robotic manipulation tasks, we find that 3D point cloud RL can significantly outperform the 2D counterpart when agent-object / object-object relationship encoding is a key factor.

READ FULL TEXT

page 5

page 7

research
04/28/2019

RL-GAN-Net: A Reinforcement Learning Agent Controlled GAN Network for Real-Time Point Cloud Shape Completion

We present RL-GAN-Net, where a reinforcement learning (RL) agent provide...
research
10/14/2022

Frame Mining: a Free Lunch for Learning Robotic Manipulation from 3D Point Clouds

We study how choices of input point cloud coordinate frames impact learn...
research
06/08/2021

Image2Point: 3D Point-Cloud Understanding with Pretrained 2D ConvNets

3D point-clouds and 2D images are different visual representations of th...
research
06/17/2020

TearingNet: Point Cloud Autoencoder to Learn Topology-Friendly Representations

Topology matters. Despite the recent success of point cloud processing w...
research
07/28/2023

TrackAgent: 6D Object Tracking via Reinforcement Learning

Tracking an object's 6D pose, while either the object itself or the obse...
research
03/30/2021

3D AffordanceNet: A Benchmark for Visual Object Affordance Understanding

The ability to understand the ways to interact with objects from visual ...
research
03/30/2023

Learning in Factored Domains with Information-Constrained Visual Representations

Humans learn quickly even in tasks that contain complex visual informati...

Please sign up or login with your details

Forgot password? Click here to reset