Holistic Multi-View Building Analysis in the Wild with Projection Pooling

08/23/2020
by   Zbigniew Wojna, et al.
5

We address six different classification tasks related to fine-grained building attributes: construction type, number of floors, pitch, and geometry of the roof, facade material, and occupancy class. Tackling such a problem of remote building analysis became possible only recently due to growing large-scale datasets of urban scenes. To this end, we introduce a new benchmarking dataset, consisting of 49426 top-view and street-view images of 9674 buildings. These photos are further assembled, together with the geometric metadata. The dataset showcases a variety of real-world challenges, such as occlusions, blur, partially visible objects, and a broad spectrum of buildings. We propose a new projection pooling layer, creating a unified, top-view representation of the top-view and the side views in a high-dimensional space. It allows us to utilize the building and imagery metadata seamlessly. Introducing this layer improves classification accuracy – compared to highly tuned baseline models – indicating its suitability for building analysis.

READ FULL TEXT

page 2

page 9

page 19

page 20

page 21

page 22

page 23

page 25

research
05/04/2023

UrbanBIS: a Large-scale Benchmark for Fine-grained Urban Building Instance Segmentation

We present the UrbanBIS benchmark for large-scale 3D urban understanding...
research
10/03/2020

Bounding Boxes Are All We Need: Street View Image Classification via Context Encoding of Detected Buildings

Street view images have been increasingly used in tasks like urban land ...
research
08/01/2022

OmniCity: Omnipotent City Understanding with Multi-level and Multi-view Images

This paper presents OmniCity, a new dataset for omnipotent city understa...
research
10/14/2019

Building Information Modeling and Classification by Visual Learning At A City Scale

In this paper, we provide two case studies to demonstrate how artificial...
research
09/14/2023

Towards Large-scale Building Attribute Mapping using Crowdsourced Images: Scene Text Recognition on Flickr and Problems to be Solved

Crowdsourced platforms provide huge amounts of street-view images that c...
research
05/04/2019

Automated Building Image Extraction from 360-degree Panoramas for Post-Disaster Evaluation

After a disaster, teams of structural engineers collect vast amounts of ...
research
10/27/2018

3D Terrain Segmentation in the SWIR Spectrum

We focus on the automatic 3D terrain segmentation problem using hyperspe...

Please sign up or login with your details

Forgot password? Click here to reset