GeoMAE: Masked Geometric Target Prediction for Self-supervised Point Cloud Pre-Training

05/15/2023
by   Xiaoyu Tian, et al.
4

This paper tries to address a fundamental question in point cloud self-supervised learning: what is a good signal we should leverage to learn features from point clouds without annotations? To answer that, we introduce a point cloud representation learning framework, based on geometric feature reconstruction. In contrast to recent papers that directly adopt masked autoencoder (MAE) and only predict original coordinates or occupancy from masked point clouds, our method revisits differences between images and point clouds and identifies three self-supervised learning objectives peculiar to point clouds, namely centroid prediction, normal estimation, and curvature prediction. Combined with occupancy prediction, these four objectives yield an nontrivial self-supervised learning task and mutually facilitate models to better reason fine-grained geometry of point clouds. Our pipeline is conceptually simple and it consists of two major steps: first, it randomly masks out groups of points, followed by a Transformer-based point cloud encoder; second, a lightweight Transformer decoder predicts centroid, normal, and curvature for points in each voxel. We transfer the pre-trained Transformer encoder to a downstream peception model. On the nuScene Datset, our model achieves 3.38 mAP improvment for object detection, 2.1 mIoU gain for segmentation, and 1.7 AMOTA gain for multi-object tracking. We also conduct experiments on the Waymo Open Dataset and achieve significant performance improvements over baselines as well.

READ FULL TEXT
research
03/29/2023

Point2Vec for Self-Supervised Representation Learning on Point Clouds

Recently, the self-supervised learning framework data2vec has shown insp...
research
09/13/2022

SeRP: Self-Supervised Representation Learning Using Perturbed Point Clouds

We present SeRP, a framework for Self-Supervised Learning of 3D point cl...
research
01/21/2023

Slice Transformer and Self-supervised Learning for 6DoF Localization in 3D Point Cloud Maps

Precise localization is critical for autonomous vehicles. We present a s...
research
03/23/2023

PointGame: Geometrically and Adaptively Masked Auto-Encoder on Point Clouds

Self-supervised learning is attracting large attention in point cloud un...
research
07/28/2023

Point Clouds Are Specialized Images: A Knowledge Transfer Approach for 3D Understanding

Self-supervised representation learning (SSRL) has gained increasing att...
research
07/06/2020

Geometric Attention for Prediction of Differential Properties in 3D Point Clouds

Estimation of differential geometric quantities in discrete 3D data repr...
research
01/03/2022

Implicit Autoencoder for Point Cloud Self-supervised Representation Learning

Many 3D representations (e.g., point clouds) are discrete samples of the...

Please sign up or login with your details

Forgot password? Click here to reset