MonoDistill: Learning Spatial Features for Monocular 3D Object Detection

01/26/2022
by   Zhiyu Chong, et al.
7

3D object detection is a fundamental and challenging task for 3D scene understanding, and the monocular-based methods can serve as an economical alternative to the stereo-based or LiDAR-based methods. However, accurately detecting objects in the 3D space from a single image is extremely difficult due to the lack of spatial cues. To mitigate this issue, we propose a simple and effective scheme to introduce the spatial information from LiDAR signals to the monocular 3D detectors, without introducing any extra cost in the inference phase. In particular, we first project the LiDAR signals into the image plane and align them with the RGB images. After that, we use the resulting data to train a 3D detector (LiDAR Net) with the same architecture as the baseline model. Finally, this LiDAR Net can serve as the teacher to transfer the learned knowledge to the baseline model. Experimental results show that the proposed method can significantly boost the performance of the baseline model and ranks the 1^st place among all monocular-based methods on the KITTI benchmark. Besides, extensive ablation studies are conducted, which further prove the effectiveness of each part of our designs and illustrate what the baseline model has learned from the LiDAR Net. Our code will be released at <https://github.com/monster-ghost/MonoDistill>.

READ FULL TEXT

page 3

page 4

page 6

page 9

page 16

page 17

research
11/17/2022

BEVDistill: Cross-Modal BEV Distillation for Multi-View 3D Object Detection

3D object detection from multiple image views is a fundamental and chall...
research
04/19/2021

Lidar Point Cloud Guided Monocular 3D Object Detection

Monocular 3D object detection is drawing increasing attention from the c...
research
03/04/2022

Pseudo-Stereo for Monocular 3D Object Detection in Autonomous Driving

Pseudo-LiDAR 3D detectors have made remarkable progress in monocular 3D ...
research
07/17/2023

Monocular 3D Object Detection with LiDAR Guided Semi Supervised Active Learning

We propose a novel semi-supervised active learning (SSAL) framework for ...
research
08/11/2020

Rethinking Pseudo-LiDAR Representation

The recently proposed pseudo-LiDAR based 3D detectors greatly improve th...
research
12/10/2020

Demystifying Pseudo-LiDAR for Monocular 3D Object Detection

Pseudo-LiDAR-based methods for monocular 3D object detection have genera...
research
11/30/2022

Attention-based Depth Distillation with 3D-Aware Positional Encoding for Monocular 3D Object Detection

Monocular 3D object detection is a low-cost but challenging task, as it ...

Please sign up or login with your details

Forgot password? Click here to reset