Frame Mining: a Free Lunch for Learning Robotic Manipulation from 3D Point Clouds

10/14/2022
by   Minghua Liu, et al.
0

We study how choices of input point cloud coordinate frames impact learning of manipulation skills from 3D point clouds. There exist a variety of coordinate frame choices to normalize captured robot-object-interaction point clouds. We find that different frames have a profound effect on agent learning performance, and the trend is similar across 3D backbone networks. In particular, the end-effector frame and the target-part frame achieve higher training efficiency than the commonly used world frame and robot-base frame in many tasks, intuitively because they provide helpful alignments among point clouds across time steps and thus can simplify visual module learning. Moreover, the well-performing frames vary across tasks, and some tasks may benefit from multiple frame candidates. We thus propose FrameMiners to adaptively select candidate frames and fuse their merits in a task-agnostic manner. Experimentally, FrameMiners achieves on-par or significantly higher performance than the best single-frame version on five fully physical manipulation tasks adapted from ManiSkill and OCRTOC. Without changing existing camera placements or adding extra cameras, point cloud frame mining can serve as a free lunch to improve 3D manipulation learning.

READ FULL TEXT

page 3

page 8

page 13

page 17

research
12/18/2020

PointINet: Point Cloud Frame Interpolation Network

LiDAR point cloud streams are usually sparse in time dimension, which is...
research
08/12/2020

ASAP-Net: Attention and Structure Aware Point Cloud Sequence Segmentation

Recent works of point clouds show that mulit-frame spatio-temporal model...
research
06/11/2023

On the Efficacy of 3D Point Cloud Reinforcement Learning

Recent studies on visual reinforcement learning (visual RL) have explore...
research
03/07/2018

Adapting Everyday Manipulation Skills to Varied Scenarios

We address the problem of executing tool-using manipulation skills in sc...
research
02/23/2022

Let's Handle It: Generalizable Manipulation of Articulated Objects

In this project we present a framework for building generalizable manipu...
research
09/24/2020

Multi-Frame to Single-Frame: Knowledge Distillation for 3D Object Detection

A common dilemma in 3D object detection for autonomous driving is that h...
research
04/23/2019

Graph-based Inpainting for 3D Dynamic Point Clouds

With the development of depth sensors and 3D laser scanning techniques, ...

Please sign up or login with your details

Forgot password? Click here to reset