Accurate Monocular 3D Object Detection via Color-Embedded 3D Reconstruction for Autonomous Driving

03/27/2019
by   Xinzhu Ma, et al.
0

In this paper, we propose a monocular 3D object detection framework in the domain of autonomous driving. Unlike previous image-based methods which focus on RGB feature extracted from 2D images, our method solves this problem in the reconstructed 3D space in order to exploit 3D contexts explicitly. To this end, we first leverage a stand-alone module to transform the input data from 2D image plane to 3D point clouds space for a better input representation, then we perform the 3D detection using PointNet backbone net to obtain objects 3D locations, dimensions and orientations. To enhance the discriminative capability of point clouds, we propose a multi-modal feature fusion module to embed the complementary RGB cue into the generated point clouds representation. We argue that it is more effective to infer the 3D bounding boxes from the generated 3D scene space (i.e., X,Y, Z space) compared to the image plane (i.e., R,G,B image plane). Evaluation on the challenging KITTI dataset shows that our approach boosts the performance of state-of-the-art monocular approach by a large margin, i.e., around 15 detection tasks for Car category at 0.7 IoU threshold.

READ FULL TEXT

page 1

page 3

page 8

research
07/13/2020

CenterNet3D:An Anchor free Object Detector for Autonomous Driving

Accurate and fast 3D object detection from point clouds is a key task in...
research
01/29/2020

ImVoteNet: Boosting 3D Object Detection in Point Clouds with Image Votes

3D object detection has seen quick progress thanks to advances in deep l...
research
07/16/2022

Consistency of Implicit and Explicit Features Matters for Monocular 3D Object Detection

Monocular 3D object detection is a common solution for low-cost autonomo...
research
03/05/2021

labelCloud: A Lightweight Domain-Independent Labeling Tool for 3D Object Detection in Point Clouds

Within the past decade, the rise of applications based on artificial int...
research
01/08/2019

3D Object Detection Using Scale Invariant and Feature Reweighting Networks

3D object detection plays an important role in a large number of real-wo...
research
01/17/2022

AutoAlign: Pixel-Instance Feature Aggregation for Multi-Modal 3D Object Detection

Object detection through either RGB images or the LiDAR point clouds has...
research
07/08/2021

Multi-Modality Task Cascade for 3D Object Detection

Point clouds and RGB images are naturally complementary modalities for 3...

Please sign up or login with your details

Forgot password? Click here to reset