Single Image 3D Object Estimation with Primitive Graph Networks

09/09/2021
by   Qian He, et al.
11

Reconstructing 3D object from a single image (RGB or depth) is a fundamental problem in visual scene understanding and yet remains challenging due to its ill-posed nature and complexity in real-world scenes. To address those challenges, we adopt a primitive-based representation for 3D object, and propose a two-stage graph network for primitive-based 3D object estimation, which consists of a sequential proposal module and a graph reasoning module. Given a 2D image, our proposal module first generates a sequence of 3D primitives from input image with local feature attention. Then the graph reasoning module performs joint reasoning on a primitive graph to capture the global shape context for each primitive. Such a framework is capable of taking into account rich geometry and semantic constraints during 3D structure recovery, producing 3D objects with more coherent structure even under challenging viewing conditions. We train the entire graph neural network in a stage-wise strategy and evaluate it on three benchmarks: Pix3D, ModelNet and NYU Depth V2. Extensive experiments show that our approach outperforms the previous state of the arts with a considerable margin.

READ FULL TEXT

page 8

page 11

research
03/11/2021

Holistic 3D Scene Understanding from a Single Image with Implicit Representation

We present a new pipeline for holistic 3D scene understanding from a sin...
research
08/20/2022

Learning Primitive-aware Discriminative Representations for FSL

Few-shot learning (FSL) aims to learn a classifier that can be easily ad...
research
03/27/2021

SceneGraphFusion: Incremental 3D Scene Graph Prediction from RGB-D Sequences

Scene graphs are a compact and explicit representation successfully used...
research
08/25/2022

Learning Continuous Implicit Representation for Near-Periodic Patterns

Near-Periodic Patterns (NPP) are ubiquitous in man-made scenes and are c...
research
05/05/2021

Cuboids Revisited: Learning Robust 3D Shape Fitting to Single RGB Images

Humans perceive and construct the surrounding world as an arrangement of...
research
07/09/2023

Convex Decomposition of Indoor Scenes

We describe a method to parse a complex, cluttered indoor scene into pri...
research
10/21/2020

Neural Star Domain as Primitive Representation

Reconstructing 3D objects from 2D images is a fundamental task in comput...

Please sign up or login with your details

Forgot password? Click here to reset