Diversity Matters: Fully Exploiting Depth Clues for Reliable Monocular 3D Object Detection

05/19/2022
by   Zhuoling Li, et al.
1

As an inherently ill-posed problem, depth estimation from single images is the most challenging part of monocular 3D object detection (M3OD). Many existing methods rely on preconceived assumptions to bridge the missing spatial information in monocular images, and predict a sole depth value for every object of interest. However, these assumptions do not always hold in practical applications. To tackle this problem, we propose a depth solving system that fully explores the visual clues from the subtasks in M3OD and generates multiple estimations for the depth of each target. Since the depth estimations rely on different assumptions in essence, they present diverse distributions. Even if some assumptions collapse, the estimations established on the remaining assumptions are still reliable. In addition, we develop a depth selection and combination strategy. This strategy is able to remove abnormal estimations caused by collapsed assumptions, and adaptively combine the remaining estimations into a single one. In this way, our depth solving system becomes more precise and robust. Exploiting the clues from multiple subtasks of M3OD and without introducing any extra information, our method surpasses the current best method by more than 20 the KITTI 3D object detection benchmark, while still maintaining real-time efficiency.

READ FULL TEXT

page 1

page 4

page 8

research
06/15/2022

MonoGround: Detecting Monocular 3D Objects from the Ground

Monocular 3D object detection has attracted great attention for its adva...
research
04/06/2021

Objects are Different: Flexible Monocular 3D Object Detection

The precise localization of 3D objects from a single image without depth...
research
05/27/2020

Center3D: Center-based Monocular 3D Object Detection with Joint Depth Understanding

Localizing objects in 3D space and understanding their associated 3D pro...
research
07/29/2021

Geometry Uncertainty Projection Network for Monocular 3D Object Detection

Geometry Projection is a powerful depth estimation method in monocular 3...
research
02/13/2020

Object Detection on Single Monocular Images through Canonical Correlation Analysis

Without using extra 3-D data like points cloud or depth images for provi...
research
07/29/2021

Probabilistic and Geometric Depth: Detecting Objects in Perspective

3D object detection is an important capability needed in various practic...
research
06/08/2022

Delving into the Pre-training Paradigm of Monocular 3D Object Detection

The labels of monocular 3D object detection (M3OD) are expensive to obta...

Please sign up or login with your details

Forgot password? Click here to reset