Voxel-MAE: Masked Autoencoders for Pre-training Large-scale Point Clouds

06/20/2022
by   Chen Min, et al.
0

Mask-based pre-training has achieved great success for self-supervised learning in image, video, and language, without manually annotated supervision. However, it has not yet been studied about large-scale point clouds with redundant spatial information in autonomous driving. As the number of large-scale point clouds is huge, it is impossible to reconstruct the input point clouds. In this paper, we propose a mask voxel classification network for large-scale point clouds pre-training. Our key idea is to divide the point clouds into voxel representations and classify whether the voxel contains point clouds. This simple strategy makes the network to be voxel-aware of the object shape, thus improving the performance of the downstream tasks, such as 3D object detection. Our Voxel-MAE with even a 90 representative features for the high spatial redundancy of large-scale point clouds. We also validate the effectiveness of Voxel-MAE in unsupervised domain adaptative tasks, which proves the generalization ability of Voxel-MAE. Our Voxel-MAE proves that it is feasible to pre-train large-scale point clouds without data annotations to enhance the perception ability of the autonomous vehicle. Extensive experiments show great effectiveness of our pre-trained model with 3D object detectors (SECOND, CenterPoint, and PV-RCNN) on three popular datasets (KITTI, Waymo, and nuScenes). Codes are publicly available at https://github.com/chaytonmin/Voxel-MAE.

READ FULL TEXT
research
07/01/2022

Masked Autoencoders for Self-Supervised Learning on Automotive Point Clouds

Masked autoencoding has become a successful pre-training paradigm for Tr...
research
07/28/2023

VPP: Efficient Conditional 3D Generation via Voxel-Point Progressive Representation

Conditional 3D generation is undergoing a significant advancement, enabl...
research
03/19/2022

Representation-Agnostic Shape Fields

3D shape analysis has been widely explored in the era of deep learning. ...
research
12/06/2022

GD-MAE: Generative Decoder for MAE Pre-training on LiDAR Point Clouds

Despite the tremendous progress of Masked Autoencoders (MAE) in developi...
research
12/14/2022

MAELi – Masked Autoencoder for Large-Scale LiDAR Point Clouds

We show how the inherent, but often neglected, properties of large-scale...
research
07/28/2021

Multi Point-Voxel Convolution (MPVConv) for Deep Learning on Point Clouds

The existing 3D deep learning methods adopt either individual point-base...
research
05/23/2023

Hierarchical Adaptive Voxel-guided Sampling for Real-time Applications in Large-scale Point Clouds

While point-based neural architectures have demonstrated their efficacy,...

Please sign up or login with your details

Forgot password? Click here to reset