Learning a Disentangled Embedding for Monocular 3D Shape Retrieval and Pose Estimation

12/24/2018
by   Kyaw Zaw Lin, et al.
0

We propose a novel approach to jointly perform 3D object retrieval and pose estimation from monocular images.In order to make the method robust to real world scene variations in the images, e.g. texture, lighting and background,we learn an embedding space from 3D data that only includes the relevant information, namely the shape and pose.Our method can then be trained for robustness under real world scene variations without having to render a large training set simulating these variations. Our learned embedding explicitly disentangles a shape vector and a pose vector, which alleviates both pose bias for 3D shape retrieval and categorical bias for pose estimation. Having the learned disentangled embedding, we train a CNN to map the images to the embedding space, and then retrieve the closest 3D shape from the database and estimate the 6D pose of the object using the embedding vectors. Our method achieves 10.8 median error for pose estimation and 0.514 top-1-accuracy for category agnostic 3D object retrieval on the Pascal3D+ dataset. It therefore outperforms the previous state-of-the-art methods on both tasks.

READ FULL TEXT
research
07/27/2021

Disentangled Implicit Shape and Pose Learning for Scalable 6D Pose Estimation

6D pose estimation of rigid objects from a single RGB image has seen tre...
research
09/11/2023

ViHOPE: Visuotactile In-Hand Object 6D Pose Estimation with Shape Completion

In this letter, we introduce ViHOPE, a novel framework for estimating th...
research
02/07/2023

3D Neural Embedding Likelihood for Robust Sim-to-Real Transfer in Inverse Graphics

A central challenge in 3D scene perception via inverse graphics is robus...
research
08/12/2022

Category-Level Pose Retrieval with Contrastive Features Learnt with Occlusion Augmentation

Pose estimation is usually tackled as either a bin classification proble...
research
04/17/2023

Uncovering the Background-Induced bias in RGB based 6-DoF Object Pose Estimation

In recent years, there has been a growing trend of using data-driven met...
research
03/29/2016

Learning a Predictable and Generative Vector Representation for Objects

What is a good vector representation of an object? We believe that it sh...
research
03/12/2020

CPS: Class-level 6D Pose and Shape Estimation From Monocular Images

Contemporary monocular 6D pose estimation methods can only cope with a h...

Please sign up or login with your details

Forgot password? Click here to reset