Leveraging Pre-Trained 3D Object Detection Models For Fast Ground Truth Generation

07/16/2018
by   Jungwook Lee, et al.
0

Training 3D object detectors for autonomous driving has been limited to small datasets due to the effort required to generate annotations. Reducing both task complexity and the amount of task switching done by annotators is key to reducing the effort and time required to generate 3D bounding box annotations. This paper introduces a novel ground truth generation method that combines human supervision with pretrained neural networks to generate per-instance 3D point cloud segmentation, 3D bounding boxes, and class annotations. The annotators provide object anchor clicks which behave as a seed to generate instance segmentation results in 3D. The points belonging to each instance are then used to regress object centroids, bounding box dimensions, and object orientation. Our proposed annotation scheme requires 30x lower human annotation time. We use the KITTI 3D object detection dataset to evaluate the efficiency and the quality of our annotation scheme. We also test the the proposed scheme on previously unseen data from the Autonomoose self-driving vehicle to demonstrate generalization capabilities of the network.

READ FULL TEXT

page 1

page 3

page 4

page 6

page 7

research
11/18/2021

Towards Open Vocabulary Object Detection without Human-provided Bounding Boxes

Despite great progress in object detection, most existing methods are li...
research
09/18/2023

Drawing the Same Bounding Box Twice? Coping Noisy Annotations in Object Detection with Repeated Labels

The reliability of supervised machine learning systems depends on the ac...
research
03/24/2021

TagMe: GPS-Assisted Automatic Object Annotation in Videos

Training high-accuracy object detection models requires large and divers...
research
08/09/2017

Extreme clicking for efficient object annotation

Manually annotating object bounding boxes is central to building compute...
research
07/11/2016

Benchmark for License Plate Character Segmentation

Automatic License Plate Recognition (ALPR) has been the focus of many re...
research
12/11/2013

Associative embeddings for large-scale knowledge transfer with self-assessment

We propose a method for knowledge transfer between semantically related ...
research
12/08/2020

A Dataset and Application for Facial Recognition of Individual Gorillas in Zoo Environments

We put forward a video dataset with 5k+ facial bounding box annotations ...

Please sign up or login with your details

Forgot password? Click here to reset