Spatio-temporal Self-Supervised Representation Learning for 3D Point Clouds

09/01/2021
by   Siyuan Huang, et al.
0

To date, various 3D scene understanding tasks still lack practical and generalizable pre-trained models, primarily due to the intricate nature of 3D scene understanding tasks and their immense variations introduced by camera views, lighting, occlusions, etc. In this paper, we tackle this challenge by introducing a spatio-temporal representation learning (STRL) framework, capable of learning from unlabeled 3D point clouds in a self-supervised fashion. Inspired by how infants learn from visual data in the wild, we explore the rich spatio-temporal cues derived from the 3D data. Specifically, STRL takes two temporally-correlated frames from a 3D point cloud sequence as the input, transforms it with the spatial data augmentation, and learns the invariant representation self-supervisedly. To corroborate the efficacy of STRL, we conduct extensive experiments on three types (synthetic, indoor, and outdoor) of datasets. Experimental results demonstrate that, compared with supervised learning methods, the learned self-supervised representation facilitates various models to attain comparable or even better performances while capable of generalizing pre-trained models to downstream tasks, including 3D shape classification, 3D object detection, and 3D semantic segmentation. Moreover, the spatio-temporal contextual cues embedded in 3D point clouds significantly improve the learned representations.

READ FULL TEXT

page 1

page 5

research
01/09/2022

Self-Supervised Feature Learning from Partial Point Clouds via Pose Disentanglement

Self-supervised learning on point clouds has gained a lot of attention r...
research
11/02/2022

Joint Data and Feature Augmentation for Self-Supervised Representation Learning on Point Clouds

To deal with the exhausting annotations, self-supervised representation ...
research
08/18/2023

Masked Spatio-Temporal Structure Prediction for Self-supervised Learning on Point Cloud Videos

Recently, the community has made tremendous progress in developing effec...
research
07/29/2020

Whole MILC: generalizing learned dynamics across tasks, datasets, and populations

Behavioral changes are the earliest signs of a mental disorder, but argu...
research
08/18/2023

Point Contrastive Prediction with Semantic Clustering for Self-Supervised Learning on Point Cloud Videos

We propose a unified point cloud video self-supervised learning framewor...
research
03/30/2021

Learning Parallel Dense Correspondence from Spatio-Temporal Descriptors for Efficient and Robust 4D Reconstruction

This paper focuses on the task of 4D shape reconstruction from a sequenc...
research
12/06/2022

Objects as Spatio-Temporal 2.5D points

Determining accurate bird's eye view (BEV) positions of objects and trac...

Please sign up or login with your details

Forgot password? Click here to reset