Learning Monocular 3D Vehicle Detection without 3D Bounding Box Labels

10/07/2020
by   L. Koestler, et al.
0

The training of deep-learning-based 3D object detectors requires large datasets with 3D bounding box labels for supervision that have to be generated by hand-labeling. We propose a network architecture and training procedure for learning monocular 3D object detection without 3D bounding box labels. By representing the objects as triangular meshes and employing differentiable shape rendering, we define loss functions based on depth maps, segmentation masks, and ego- and object-motion, which are generated by pre-trained, off-the-shelf networks. We evaluate the proposed algorithm on the real-world KITTI dataset and achieve promising performance in comparison to state-of-the-art methods requiring 3D bounding box labels for training and superior performance to conventional baseline methods.

READ FULL TEXT

page 2

page 4

page 6

page 7

page 9

research
12/24/2013

Deep learning for class-generic object detection

We investigate the use of deep neural networks for the novel task of cla...
research
11/27/2018

Sampling Techniques for Large-Scale Object Detection from Sparsely Annotated Objects

Efficient and reliable methods for training of object detectors are in h...
research
07/23/2022

3D Labeling Tool

Training and testing supervised object detection models require a large ...
research
04/23/2019

Transferable Semi-supervised 3D Object Detection from RGB-D Data

We investigate the direction of training a 3D object detector for new ob...
research
11/02/2022

OPA-3D: Occlusion-Aware Pixel-Wise Aggregation for Monocular 3D Object Detection

Despite monocular 3D object detection having recently made a significant...
research
04/22/2019

Detecting retail products in situ using CNN without human effort labeling

CNN is a powerful tool for many computer vision tasks, achieving much be...

Please sign up or login with your details

Forgot password? Click here to reset