Holistic 3D Scene Understanding from a Single Image with Implicit Representation

03/11/2021
by   Cheng Zhang, et al.
47

We present a new pipeline for holistic 3D scene understanding from a single image, which could predict object shape, object pose, and scene layout. As it is a highly ill-posed problem, existing methods usually suffer from inaccurate estimation of both shapes and layout especially for the cluttered scene due to the heavy occlusion between objects. We propose to utilize the latest deep implicit representation to solve this challenge. We not only propose an image-based local structured implicit network to improve the object shape estimation, but also refine 3D object pose and scene layout via a novel implicit scene graph neural network that exploits the implicit local object features. A novel physical violation loss is also proposed to avoid incorrect context between objects. Extensive experiments demonstrate that our method outperforms the state-of-the-art methods in terms of object shape, scene layout estimation, and 3D object detection.

READ FULL TEXT

page 1

page 3

page 5

page 7

page 8

research
12/05/2017

Factoring Shape, Pose, and Layout from the 2D Image of a 3D Scene

The goal of this paper is to take a single 2D image of a scene and recov...
research
06/06/2021

Neural Implicit 3D Shapes from Single Images with Spatial Patterns

3D shape reconstruction from a single image has been a long-standing pro...
research
03/16/2016

DeepContext: Context-Encoding Neural Pathways for 3D Holistic Scene Understanding

While deep neural networks have led to human-level performance on comput...
research
09/09/2021

Single Image 3D Object Estimation with Primitive Graph Networks

Reconstructing 3D object from a single image (RGB or depth) is a fundame...
research
12/09/2019

Learning a Layout Transfer Network for Context Aware Object Detection

We present a context aware object detection method based on a retrieve-a...
research
08/25/2022

Learning Continuous Implicit Representation for Near-Periodic Patterns

Near-Periodic Patterns (NPP) are ubiquitous in man-made scenes and are c...
research
08/22/2023

Novel-view Synthesis and Pose Estimation for Hand-Object Interaction from Sparse Views

Hand-object interaction understanding and the barely addressed novel vie...

Please sign up or login with your details

Forgot password? Click here to reset