ConDor: Self-Supervised Canonicalization of 3D Pose for Partial Shapes

01/19/2022
by   Rahul Sajnani, et al.
5

Progress in 3D object understanding has relied on manually canonicalized shape datasets that contain instances with consistent position and orientation (3D pose). This has made it hard to generalize these methods to in-the-wild shapes, eg., from internet model collections or depth sensors. ConDor is a self-supervised method that learns to Canonicalize the 3D orientation and position for full and partial 3D point clouds. We build on top of Tensor Field Networks (TFNs), a class of permutation- and rotation-equivariant, and translation-invariant 3D networks. During inference, our method takes an unseen full or partial 3D point cloud at an arbitrary pose and outputs an equivariant canonical pose. During training, this network uses self-supervision losses to learn the canonical pose from an un-canonicalized collection of full and partial 3D point clouds. ConDor can also learn to consistently co-segment object parts without any supervision. Extensive quantitative results on four new metrics show that our approach outperforms existing methods while enabling new applications such as operation on depth images and annotation transfer.

READ FULL TEXT

page 4

page 8

page 14

page 16

page 17

research
01/09/2022

Self-Supervised Feature Learning from Partial Point Clouds via Pose Disentanglement

Self-supervised learning on point clouds has gained a lot of attention r...
research
12/05/2022

Canonical Fields: Self-Supervised Learning of Pose-Canonicalized Neural Fields

Coordinate-based implicit neural networks, or neural fields, have emerge...
research
02/01/2021

Adjoint Rigid Transform Network: Self-supervised Alignment of 3D Shapes

Most learning methods for 3D data (point clouds, meshes) suffer signific...
research
11/06/2020

Learning to Orient Surfaces by Self-supervised Spherical CNNs

Defining and reliably finding a canonical orientation for 3D surfaces is...
research
02/22/2018

Tensor Field Networks: Rotation- and Translation-Equivariant Neural Networks for 3D Point Clouds

We introduce tensor field networks, which are locally equivariant to 3D ...
research
04/03/2022

Shape-Pose Disentanglement using SE(3)-equivariant Vector Neurons

We introduce an unsupervised technique for encoding point clouds into a ...
research
08/06/2020

CaSPR: Learning Canonical Spatiotemporal Point Cloud Representations

We propose CaSPR, a method to learn object-centric canonical spatiotempo...

Please sign up or login with your details

Forgot password? Click here to reset