The Devil is in the Task: Exploiting Reciprocal Appearance-Localization Features for Monocular 3D Object Detection

12/28/2021
by   Zhikang Zou, et al.
12

Low-cost monocular 3D object detection plays a fundamental role in autonomous driving, whereas its accuracy is still far from satisfactory. In this paper, we dig into the 3D object detection task and reformulate it as the sub-tasks of object localization and appearance perception, which benefits to a deep excavation of reciprocal information underlying the entire task. We introduce a Dynamic Feature Reflecting Network, named DFR-Net, which contains two novel standalone modules: (i) the Appearance-Localization Feature Reflecting module (ALFR) that first separates taskspecific features and then self-mutually reflects the reciprocal features; (ii) the Dynamic Intra-Trading module (DIT) that adaptively realigns the training processes of various sub-tasks via a self-learning manner. Extensive experiments on the challenging KITTI dataset demonstrate the effectiveness and generalization of DFR-Net. We rank 1st among all the monocular 3D object detectors in the KITTI test set (till March 16th, 2021). The proposed method is also easy to be plug-and-play in many cutting-edge 3D detection frameworks at negligible cost to boost performance. The code will be made publicly available.

READ FULL TEXT

page 3

page 7

research
07/19/2020

Kinematic 3D Object Detection in Monocular Video

Perceiving the physical world in 3D is fundamental for self-driving appl...
research
02/01/2021

Ground-aware Monocular 3D Object Detection for Autonomous Driving

Estimating the 3D position and orientation of objects in the environment...
research
03/02/2023

Task-Specific Context Decoupling for Object Detection

Classification and localization are two main sub-tasks in object detecti...
research
12/03/2021

SGM3D: Stereo Guided Monocular 3D Object Detection

Monocular 3D object detection is a critical yet challenging task for aut...
research
03/30/2021

Delving into Localization Errors for Monocular 3D Object Detection

Estimating 3D bounding boxes from monocular images is an essential compo...
research
11/21/2022

Simultaneous Multiple Object Detection and Pose Estimation using 3D Model Infusion with Monocular Vision

Multiple object detection and pose estimation are vital computer vision ...

Please sign up or login with your details

Forgot password? Click here to reset