Cross-Domain 3D Equivariant Image Embeddings

12/06/2018
by   Carlos Esteves, et al.
14

Spherical convolutional networks have been introduced recently as tools to learn powerful feature representations of 3D shapes. Spherical CNNs are equivariant to 3D rotations making them ideally suited for applications where 3D data may be observed in arbitrary orientations. In this paper we learn 2D image embeddings with a similar equivariant structure: embedding the image of a 3D object should commute with rotations of the object. We introduce a cross-domain embedding from 2D images into a spherical CNN latent space. Our model is supervised only by target embeddings obtained from a spherical CNN pretrained for 3D shape classification. The trained model learns to encode images with 3D shape properties and is equivariant to 3D rotations of the observed object. We show that learning only a rich embedding for images with appropriate geometric structure is in and of itself sufficient for tackling numerous applications. Evidence from two different applications, relative pose estimation and novel view synthesis, demonstrates that equivariant embeddings are sufficient for both applications without requiring any task-specific supervised training.

READ FULL TEXT

page 10

page 11

research
12/04/2020

Learning Equivariant Representations

State-of-the-art deep learning systems often require large amounts of da...
research
10/12/2020

Spherical Convolutional Neural Networks: Stability to Perturbations in SO(3)

Spherical signals are useful mathematical models for data arising in man...
research
12/25/2020

Three-dimensional Simultaneous Shape and Pose Estimation for Extended Objects Using Spherical Harmonics

We propose a new recursive method for simultaneous estimation of both th...
research
08/11/2019

Cross-Domain Collaborative Filtering via Translation-based Learning

With the proliferation of social media platforms and e-commerce sites, s...
research
01/01/2022

SurfGen: Adversarial 3D Shape Synthesis with Explicit Surface Discriminators

Recent advances in deep generative models have led to immense progress i...
research
08/05/2022

Disentangling 3D Attributes from a Single 2D Image: Human Pose, Shape and Garment

For visual manipulation tasks, we aim to represent image content with se...
research
10/11/2019

Relation learning in a neurocomputational architecture supports cross-domain transfer

People readily generalise prior knowledge to novel situations and stimul...

Please sign up or login with your details

Forgot password? Click here to reset