Explicit3D: Graph Network with Spatial Inference for Single Image 3D Object Detection

02/13/2023
by   Yanjun Liu, et al.
0

Indoor 3D object detection is an essential task in single image scene understanding, impacting spatial cognition fundamentally in visual reasoning. Existing works on 3D object detection from a single image either pursue this goal through independent predictions of each object or implicitly reason over all possible objects, failing to harness relational geometric information between objects. To address this problem, we propose a dynamic sparse graph pipeline named Explicit3D based on object geometry and semantics features. Taking the efficiency into consideration, we further define a relatedness score and design a novel dynamic pruning algorithm followed by a cluster sampling method for sparse scene graph generation and updating. Furthermore, our Explicit3D introduces homogeneous matrices and defines new relative loss and corner loss to model the spatial difference between target pairs explicitly. Instead of using ground-truth labels as direct supervision, our relative and corner loss are derived from the homogeneous transformation, which renders the model to learn the geometric consistency between objects. The experimental results on the SUN RGB-D dataset demonstrate that our Explicit3D achieves better performance balance than the-state-of-the-art.

READ FULL TEXT

page 1

page 4

page 5

page 7

page 10

page 11

research
05/05/2023

DSPDet3D: Dynamic Spatial Pruning for 3D Small Object Detection

In this paper, we propose a new detection framework for 3D small object ...
research
06/30/2018

Structure Inference Net: Object Detection Using Scene-Level Context and Instance-Level Relationships

Context is important for accurate visual recognition. In this work we pr...
research
12/09/2015

Window-Object Relationship Guided Representation Learning for Generic Object Detections

In existing works that learn representation for object detection, the re...
research
04/05/2021

Non-Homogeneous Haze Removal via Artificial Scene Prior and Bidimensional Graph Reasoning

Due to the lack of natural scene and haze prior information, it is great...
research
11/12/2015

Newtonian Image Understanding: Unfolding the Dynamics of Objects in Static Images

In this paper, we study the challenging problem of predicting the dynami...
research
02/22/2020

Shallow2Deep: Indoor Scene Modeling by Single Image Understanding

Dense indoor scene modeling from 2D images has been bottlenecked due to ...
research
07/30/2020

Perceiving 3D Human-Object Spatial Arrangements from a Single Image in the Wild

We present a method that infers spatial arrangements and shapes of human...

Please sign up or login with your details

Forgot password? Click here to reset