PanorAMS: Automatic Annotation for Detecting Objects in Urban Context

08/30/2022
by   Inske Groenen, et al.
1

Large collections of geo-referenced panoramic images are freely available for cities across the globe, as well as detailed maps with location and meta-data on a great variety of urban objects. They provide a potentially rich source of information on urban objects, but manual annotation for object detection is costly, laborious and difficult. Can we utilize such multimedia sources to automatically annotate street level images as an inexpensive alternative to manual labeling? With the PanorAMS framework we introduce a method to automatically generate bounding box annotations for panoramic images based on urban context information. Following this method, we acquire large-scale, albeit noisy, annotations for an urban dataset solely from open data sources in a fast and automatic manner. The dataset covers the City of Amsterdam and includes over 14 million noisy bounding box annotations of 22 object categories present in 771,299 panoramic images. For many objects further fine-grained information is available, obtained from geospatial meta-data, such as building value, function and average surface area. Such information would have been difficult, if not impossible, to acquire via manual labeling based on the image alone. For detailed evaluation, we introduce an efficient crowdsourcing protocol for bounding box annotations in panoramic images, which we deploy to acquire 147,075 ground-truth object annotations for a subset of 7,348 images, the PanorAMS-clean dataset. For our PanorAMS-noisy dataset, we provide an extensive analysis of the noise and how different types of noise affect image classification and object detection performance. We make both datasets, PanorAMS-noisy and PanorAMS-clean, benchmarks and tools presented in this paper openly available.

READ FULL TEXT

page 2

page 5

page 7

page 8

page 12

page 15

research
03/03/2020

Towards Noise-resistant Object Detection with Noisy Annotations

Training deep object detectors requires significant amount of human-anno...
research
09/11/2023

Gall Bladder Cancer Detection from US Images with Only Image Level Labels

Automated detection of Gallbladder Cancer (GBC) from Ultrasound (US) ima...
research
05/05/2019

Understanding urban landuse from the above and ground perspectives: a deep learning, multimodal solution

Landuse characterization is important for urban planning. It is traditio...
research
05/01/2019

3D BAT: A Semi-Automatic, Web-based 3D Annotation Toolbox for Full-Surround, Multi-Modal Data Streams

In this paper, we focus on obtaining 2D and 3D labels, as well as track ...
research
06/09/2023

DocAligner: Annotating Real-world Photographic Document Images by Simply Taking Pictures

Recently, there has been a growing interest in research concerning docum...
research
12/11/2013

Associative embeddings for large-scale knowledge transfer with self-assessment

We propose a method for knowledge transfer between semantically related ...
research
09/11/2023

CitDet: A Benchmark Dataset for Citrus Fruit Detection

In this letter, we present a new dataset to advance the state of the art...

Please sign up or login with your details

Forgot password? Click here to reset