M3D-RPN: Monocular 3D Region Proposal Network for Object Detection

07/13/2019
by   Garrick Brazil, et al.
1

Understanding the world in 3D is a critical component of urban autonomous driving. Generally, the combination of expensive LiDAR sensors and stereo RGB imaging has been paramount for successful 3D object detection algorithms, whereas monocular image-only methods experience drastically reduced performance. We propose to reduce the gap by reformulating the monocular 3D detection problem as a standalone 3D region proposal network. We leverage the geometric relationship of 2D and 3D perspectives, allowing 3D boxes to utilize well-known and powerful convolutional features generated in the image-space. To help address the strenuous 3D parameter estimations, we further design depth-aware convolutional layers which enable location specific feature development and in consequence improved 3D scene understanding. Compared to prior work in monocular 3D detection, our method consists of only the proposed 3D region proposal network rather than relying on external networks, data, or multiple stages. M3D-RPN is able to significantly improve the performance of both monocular 3D Object Detection and Bird's Eye View tasks within the KITTI urban autonomous driving dataset, while efficiently using a shared multi-class model.

READ FULL TEXT

page 1

page 4

page 8

research
02/01/2021

Ground-aware Monocular 3D Object Detection for Autonomous Driving

Estimating the 3D position and orientation of objects in the environment...
research
11/21/2019

RefinedMPL: Refined Monocular PseudoLiDAR for 3D Object Detection in Autonomous Driving

In this paper, we strive for solving the ambiguities arisen by the astou...
research
07/19/2020

Kinematic 3D Object Detection in Monocular Video

Perceiving the physical world in 3D is fundamental for self-driving appl...
research
08/24/2023

Perspective-aware Convolution for Monocular 3D Object Detection

Monocular 3D object detection is a crucial and challenging task for auto...
research
12/03/2021

SGM3D: Stereo Guided Monocular 3D Object Detection

Monocular 3D object detection is a critical yet challenging task for aut...
research
05/16/2019

Monocular Plan View Networks for Autonomous Driving

Convolutions on monocular dash cam videos capture spatial invariances in...
research
08/08/2022

Aerial Monocular 3D Object Detection

Drones equipped with cameras can significantly enhance human ability to ...

Please sign up or login with your details

Forgot password? Click here to reset