Disentangling Monocular 3D Object Detection

05/29/2019
by   Andrea Simonelli, et al.
0

In this paper we propose an approach for monocular 3D object detection from a single RGB image, which leverages a novel disentangling transformation for 2D and 3D detection losses and a novel, self-supervised confidence score for 3D bounding boxes. Our proposed loss disentanglement has the twofold advantage of simplifying the training dynamics in the presence of losses with complex interactions of parameters, and sidestepping the issue of balancing independent regression terms. Our solution overcomes these issues by isolating the contribution made by groups of parameters to a given loss, without changing its nature. We further apply loss disentanglement to another novel, signed Intersection-over-Union criterion-driven loss for improving 2D detection results. Besides our methodological innovations, we critically review the AP metric used in KITTI3D, which emerged as the most important dataset for comparing 3D detection results. We identify and resolve a flaw in the 11-point interpolated AP metric, affecting all previously published detection results and particularly biases the results of monocular 3D detection. We provide extensive experimental evaluations and ablation studies on the KITTI3D and nuScenes datasets, setting new state-of-the-art results on object category car by large margins.

READ FULL TEXT

page 1

page 3

page 6

page 10

page 11

page 14

page 15

research
05/14/2019

Monocular 3D Object Detection via Geometric Reasoning on Keypoints

Monocular 3D object detection is well-known to be a challenging vision t...
research
06/23/2020

Single-Shot 3D Detection of Vehicles from Monocular RGB Images via Geometry Constrained Keypoints in Real-Time

In this paper we propose a novel 3D single-shot object detection method ...
research
12/06/2018

ROI-10D: Monocular Lifting of 2D Detection to 6D Pose and Metric Shape

We present a deep learning method for end-to-end monocular 3D object det...
research
06/10/2021

Gaussian Bounding Boxes and Probabilistic Intersection-over-Union for Object Detection

Most object detection methods use bounding boxes to encode and represent...
research
07/19/2022

Rethinking IoU-based Optimization for Single-stage 3D Object Detection

Since Intersection-over-Union (IoU) based optimization maintains the con...
research
04/02/2019

Monocular 3D Object Detection Leveraging Accurate Proposals and Shape Reconstruction

We present MonoPSR, a monocular 3D object detection method that leverage...
research
07/17/2020

Improving Object Detection with Selective Self-supervised Self-training

We study how to leverage Web images to augment human-curated object dete...

Please sign up or login with your details

Forgot password? Click here to reset