Delving into the Pre-training Paradigm of Monocular 3D Object Detection

06/08/2022
by   Zhuoling Li, et al.
0

The labels of monocular 3D object detection (M3OD) are expensive to obtain. Meanwhile, there usually exists numerous unlabeled data in practical applications, and pre-training is an efficient way of exploiting the knowledge in unlabeled data. However, the pre-training paradigm for M3OD is hardly studied. We aim to bridge this gap in this work. To this end, we first draw two observations: (1) The guideline of devising pre-training tasks is imitating the representation of the target task. (2) Combining depth estimation and 2D object detection is a promising M3OD pre-training baseline. Afterwards, following the guideline, we propose several strategies to further improve this baseline, which mainly include target guided semi-dense depth estimation, keypoint-aware 2D object detection, and class-level loss adjustment. Combining all the developed techniques, the obtained pre-training framework produces pre-trained backbones that improve M3OD performance significantly on both the KITTI-3D and nuScenes benchmarks. For example, by applying a DLA34 backbone to a naive center-based M3OD detector, the moderate AP_3D70 score of Car on the KITTI-3D testing set is boosted by 18.71% and the NDS score on the nuScenes validation set is improved by 40.41% relatively.

READ FULL TEXT

page 4

page 13

page 14

research
08/13/2021

Is Pseudo-Lidar needed for Monocular 3D Object detection?

Recent progress in 3D object detection from single images leverages mono...
research
04/25/2020

Cheaper Pre-training Lunch: An Efficient Paradigm for Object Detection

In this paper, we propose a general and efficient pre-training paradigm,...
research
03/26/2022

Does Monocular Depth Estimation Provide Better Pre-training than Classification for Semantic Segmentation?

Training a deep neural network for semantic segmentation is labor-intens...
research
06/07/2020

CubifAE-3D: Monocular Camera Space Cubification on Autonomous Vehicles for Auto-Encoder based 3D Object Detection

We introduce a method for 3D object detection using a single monocular i...
research
03/30/2021

DAP: Detection-Aware Pre-training with Weak Supervision

This paper presents a detection-aware pre-training (DAP) approach, which...
research
05/19/2022

Diversity Matters: Fully Exploiting Depth Clues for Reliable Monocular 3D Object Detection

As an inherently ill-posed problem, depth estimation from single images ...
research
11/30/2020

Monocular 3D Object Detection with Sequential Feature Association and Depth Hint Augmentation

Monocular 3D object detection is a promising research topic for the inte...

Please sign up or login with your details

Forgot password? Click here to reset