3DP3: 3D Scene Perception via Probabilistic Programming

10/30/2021
by   Nishad Gothoskar, et al.
7

We present 3DP3, a framework for inverse graphics that uses inference in a structured generative model of objects, scenes, and images. 3DP3 uses (i) voxel models to represent the 3D shape of objects, (ii) hierarchical scene graphs to decompose scenes into objects and the contacts between them, and (iii) depth image likelihoods based on real-time graphics. Given an observed RGB-D image, 3DP3's inference algorithm infers the underlying latent 3D scene, including the object poses and a parsimonious joint parametrization of these poses, using fast bottom-up pose proposals, novel involutive MCMC updates of the scene graph structure, and, optionally, neural object detectors and pose estimators. We show that 3DP3 enables scene understanding that is aware of 3D shape, occlusion, and contact structure. Our results demonstrate that 3DP3 is more accurate at 6DoF object pose estimation from real images than deep learning baselines and shows better generalization to challenging scenes with novel viewpoints, contact, and partial observability.

READ FULL TEXT

page 6

page 8

page 15

page 16

page 17

page 18

page 19

research
02/07/2023

3D Neural Embedding Likelihood for Robust Sim-to-Real Transfer in Inverse Graphics

A central challenge in 3D scene perception via inverse graphics is robus...
research
10/08/2020

Semi-Supervised Learning of Multi-Object 3D Scene Representations

Representing scenes at the granularity of objects is a prerequisite for ...
research
07/04/2014

Inverse Graphics with Probabilistic CAD Models

Recently, multiple formulations of vision problems as probabilistic inve...
research
02/07/2023

Structured Generative Models for Scene Understanding

This position paper argues for the use of structured generative models (...
research
10/08/2019

Refining 6D Object Pose Predictions using Abstract Render-and-Compare

Robotic systems often require precise scene analysis capabilities, espec...
research
04/09/2020

MoreFusion: Multi-object Reasoning for 6D Pose Estimation from Volumetric Fusion

Robots and other smart devices need efficient object-based scene represe...
research
02/19/2020

Table-Top Scene Analysis Using Knowledge-Supervised MCMC

In this paper, we propose a probabilistic method to generate abstract sc...

Please sign up or login with your details

Forgot password? Click here to reset