Depth Estimation Matters Most: Improving Per-Object Depth Estimation for Monocular 3D Detection and Tracking

06/08/2022
by   Longlong Jing, et al.
6

Monocular image-based 3D perception has become an active research area in recent years owing to its applications in autonomous driving. Approaches to monocular 3D perception including detection and tracking, however, often yield inferior performance when compared to LiDAR-based techniques. Through systematic analysis, we identified that per-object depth estimation accuracy is a major factor bounding the performance. Motivated by this observation, we propose a multi-level fusion method that combines different representations (RGB and pseudo-LiDAR) and temporal information across multiple frames for objects (tracklets) to enhance per-object depth estimation. Our proposed fusion method achieves the state-of-the-art performance of per-object depth estimation on the Waymo Open Dataset, the KITTI detection dataset, and the KITTI MOT dataset. We further demonstrate that by simply replacing estimated depth with fusion-enhanced depth, we can achieve significant improvements in monocular 3D perception tasks, including detection and tracking.

READ FULL TEXT

page 1

page 3

page 6

research
12/18/2018

Pseudo-LiDAR from Visual Depth Estimation: Bridging the Gap in 3D Object Detection for Autonomous Driving

3D object detection is an essential task in autonomous driving. Recent t...
research
07/28/2021

Pseudo-LiDAR Based Road Detection

Road detection is a critically important task for self-driving cars. By ...
research
01/09/2023

Deep Planar Parallax for Monocular Depth Estimation

Depth estimation is a fundamental problem in the perception system of au...
research
05/24/2023

Polarimetric Imaging for Perception

Autonomous driving and advanced driver-assistance systems rely on a set ...
research
04/24/2022

RealNet: Combining Optimized Object Detection with Information Fusion Depth Estimation Co-Design Method on IoT

Depth Estimation and Object Detection Recognition play an important role...
research
12/15/2020

Detecting Invisible People

Monocular object detection and tracking have improved drastically in rec...
research
08/25/2021

Monocular Depth Estimation Primed by Salient Point Detection and Normalized Hessian Loss

Deep neural networks have recently thrived on single image depth estimat...

Please sign up or login with your details

Forgot password? Click here to reset