Monocular 3D Object Detection with Decoupled Structured Polygon Estimation and Height-Guided Depth Estimation

02/05/2020
by   Yingjie Cai, et al.
23

Monocular 3D object detection task aims to predict the 3D bounding boxes of objects based on monocular RGB images. Since the location recovery in 3D space is quite difficult on account of absence of depth information, this paper proposes a novel unified framework which decomposes the detection problem into a structured polygon prediction task and a depth recovery task. Different from the widely studied 2D bounding boxes, the proposed novel structured polygon in the 2D image consists of several projected surfaces of the target object. Compared to the widely-used 3D bounding box proposals, it is shown to be a better representation for 3D detection. In order to inversely project the predicted 2D structured polygon to a cuboid in the 3D physical world, the following depth recovery task uses the object height prior to complete the inverse projection transformation with the given camera projection matrix. Moreover, a fine-grained 3D box refinement scheme is proposed to further rectify the 3D detection results. Experiments are conducted on the challenging KITTI benchmark, in which our method achieves state-of-the-art detection accuracy.

READ FULL TEXT

page 1

page 3

page 4

page 7

research
04/18/2021

MonoGRNet: A General Framework for Monocular 3D Object Detection

Detecting and localizing objects in the real 3D space, which plays a cru...
research
07/20/2020

Object-Aware Centroid Voting for Monocular 3D Object Detection

Monocular 3D object detection aims to detect objects in a 3D physical wo...
research
04/08/2021

Geometry-based Distance Decomposition for Monocular 3D Object Detection

Monocular 3D object detection is of great significance for autonomous dr...
research
10/13/2021

DETR3D: 3D Object Detection from Multi-view Images via 3D-to-2D Queries

We introduce a framework for multi-camera 3D object detection. In contra...
research
12/09/2021

Learning Auxiliary Monocular Contexts Helps Monocular 3D Object Detection

Monocular 3D object detection aims to localize 3D bounding boxes in an i...
research
09/13/2023

Polygon Intersection-over-Union Loss for Viewpoint-Agnostic Monocular 3D Vehicle Detection

Monocular 3D object detection is a challenging task because depth inform...
research
08/02/2018

Object Localization and Size Estimation from RGB-D Images

Depth sensing cameras (e.g., Kinect sensor, Tango phone) can acquire col...

Please sign up or login with your details

Forgot password? Click here to reset