Semantic Instance Annotation of Street Scenes by 3D to 2D Label Transfer

11/10/2015
by   Jun Xie, et al.
0

Semantic annotations are vital for training models for object recognition, semantic segmentation or scene understanding. Unfortunately, pixelwise annotation of images at very large scale is labor-intensive and only little labeled data is available, particularly at instance level and for street scenes. In this paper, we propose to tackle this problem by lifting the semantic instance labeling task from 2D into 3D. Given reconstructions from stereo or laser data, we annotate static 3D scene elements with rough bounding primitives and develop a model which transfers this information into the image domain. We leverage our method to obtain 2D labels for a novel suburban video dataset which we have collected, resulting in 400k semantic and instance image annotations. A comparison of our method to state-of-the-art label transfer baselines reveals that 3D information enables more efficient annotation while at the same time resulting in improved accuracy and time-coherent labels.

READ FULL TEXT

page 2

page 5

page 8

research
03/29/2022

Panoptic NeRF: 3D-to-2D Label Transfer for Panoptic Urban Scene Segmentation

Large-scale training data with high-quality annotations is critical for ...
research
04/06/2016

The Cityscapes Dataset for Semantic Urban Scene Understanding

Visual understanding of complex urban street scenes is an enabling facto...
research
09/19/2023

PanopticNeRF-360: Panoramic 3D-to-2D Label Transfer in Urban Scenes

Training perception systems for self-driving cars requires substantial a...
research
09/28/2021

KITTI-360: A Novel Dataset and Benchmarks for Urban Scene Understanding in 2D and 3D

For the last few decades, several major subfields of artificial intellig...
research
06/18/2020

SceneAdapt: Scene-based domain adaptation for semantic segmentation using adversarial learning

Semantic segmentation methods have achieved outstanding performance than...
research
02/14/2022

COLA: COarse LAbel pre-training for 3D semantic segmentation of sparse LiDAR datasets

Transfer learning is a proven technique in 2D computer vision to leverag...
research
08/13/2018

3D Geometry-Aware Semantic Labeling of Outdoor Street Scenes

This paper is concerned with the problem of how to better exploit 3D geo...

Please sign up or login with your details

Forgot password? Click here to reset