Orthographic Feature Transform for Monocular 3D Object Detection

11/20/2018
by   Thomas Roddick, et al.
6

3D object detection from monocular images has proven to be an enormously challenging task, with the performance of leading systems not yet achieving even 10% of that of LiDAR-based counterparts. One explanation for this performance gap is that existing systems are entirely at the mercy of the perspective image-based representation, in which the appearance and scale of objects varies drastically with depth and meaningful distances are difficult to infer. In this work we argue that the ability to reason about the world in 3D is an essential element of the 3D object detection task. To this end, we introduce the orthographic feature transform, which enables us to escape the image domain by mapping image-based features into an orthographic 3D space. This allows us to reason holistically about the spatial configuration of the scene in a domain where scale is consistent and distances between objects are meaningful. We apply this transformation as part of an end-to-end deep learning architecture and achieve state-of-the-art performance on the KITTI 3D object benchmark.[We will release full source code and pretrained models upon acceptance of this manuscript for publication.]

READ FULL TEXT

page 1

page 4

page 7

research
12/18/2018

Pseudo-LiDAR from Visual Depth Estimation: Bridging the Gap in 3D Object Detection for Autonomous Driving

3D object detection is an essential task in autonomous driving. Recent t...
research
09/20/2022

Self-supervised 3D Object Detection from Monocular Pseudo-LiDAR

There have been attempts to detect 3D objects by fusion of stereo camera...
research
12/10/2019

Learning Depth-Guided Convolutions for Monocular 3D Object Detection

3D object detection from a single image without LiDAR is a challenging t...
research
12/22/2022

Monocular 3D Object Detection using Multi-Stage Approaches with Attention and Slicing aided hyper inference

3D object detection is vital as it would enable us to capture objects' s...
research
08/24/2023

Perspective-aware Convolution for Monocular 3D Object Detection

Monocular 3D object detection is a crucial and challenging task for auto...
research
07/16/2022

Consistency of Implicit and Explicit Features Matters for Monocular 3D Object Detection

Monocular 3D object detection is a common solution for low-cost autonomo...
research
08/08/2022

Aerial Monocular 3D Object Detection

Drones equipped with cameras can significantly enhance human ability to ...

Please sign up or login with your details

Forgot password? Click here to reset