3D Neural Embedding Likelihood for Robust Sim-to-Real Transfer in Inverse Graphics

02/07/2023
by   Guangyao Zhou, et al.
0

A central challenge in 3D scene perception via inverse graphics is robustly modeling the gap between 3D graphics and real-world data. We propose a novel 3D Neural Embedding Likelihood (3DNEL) over RGB-D images to address this gap. 3DNEL uses neural embeddings to predict 2D-3D correspondences from RGB and combines this with depth in a principled manner. 3DNEL is trained entirely from synthetic images and generalizes to real-world data. To showcase this capability, we develop a multi-stage inverse graphics pipeline that uses 3DNEL for 6D object pose estimation from real RGB-D images. Our method outperforms the previous state-of-the-art in sim-to-real pose estimation on the YCB-Video dataset, and improves robustness, with significantly fewer large-error predictions. Unlike existing bottom-up, discriminative approaches that are specialized for pose estimation, 3DNEL adopts a probabilistic generative formulation that jointly models multi-object scenes. This generative formulation enables easy extension of 3DNEL to additional tasks like object and camera tracking from video, using principled inference in the same probabilistic model without task specific retraining.

READ FULL TEXT

page 3

page 4

page 9

page 16

page 17

research
10/30/2021

3DP3: 3D Scene Perception via Probabilistic Programming

We present 3DP3, a framework for inverse graphics that uses inference in...
research
12/24/2018

Learning a Disentangled Embedding for Monocular 3D Shape Retrieval and Pose Estimation

We propose a novel approach to jointly perform 3D object retrieval and p...
research
11/29/2014

3D Hand Pose Detection in Egocentric RGB-D Images

We focus on the task of everyday hand pose estimation from egocentric vi...
research
01/15/2019

DenseFusion: 6D Object Pose Estimation by Iterative Dense Fusion

A key technical challenge in performing 6D object pose estimation from R...
research
09/19/2023

RGB-based Category-level Object Pose Estimation via Decoupled Metric Scale Recovery

While showing promising results, recent RGB-D camera-based category-leve...
research
06/29/2013

Approximate Bayesian Image Interpretation using Generative Probabilistic Graphics Programs

The idea of computer vision as the Bayesian inverse problem to computer ...
research
02/04/2019

Implicit 3D Orientation Learning for 6D Object Detection from RGB Images

We propose a real-time RGB-based pipeline for object detection and 6D po...

Please sign up or login with your details

Forgot password? Click here to reset