Log In Sign Up

Holistic Multi-View Building Analysis in the Wild with Projection Pooling

by   Zbigniew Wojna, et al.

We address six different classification tasks related to fine-grained building attributes: construction type, number of floors, pitch, and geometry of the roof, facade material, and occupancy class. Tackling such a problem of remote building analysis became possible only recently due to growing large-scale datasets of urban scenes. To this end, we introduce a new benchmarking dataset, consisting of 49426 top-view and street-view images of 9674 buildings. These photos are further assembled, together with the geometric metadata. The dataset showcases a variety of real-world challenges, such as occlusions, blur, partially visible objects, and a broad spectrum of buildings. We propose a new projection pooling layer, creating a unified, top-view representation of the top-view and the side views in a high-dimensional space. It allows us to utilize the building and imagery metadata seamlessly. Introducing this layer improves classification accuracy – compared to highly tuned baseline models – indicating its suitability for building analysis.


page 2

page 9

page 19

page 20

page 21

page 22

page 23

page 25


Bounding Boxes Are All We Need: Street View Image Classification via Context Encoding of Detected Buildings

Street view images have been increasingly used in tasks like urban land ...

OmniCity: Omnipotent City Understanding with Multi-level and Multi-view Images

This paper presents OmniCity, a new dataset for omnipotent city understa...

TMBuD: A dataset for urban scene building detection

Building recognition and 3D reconstruction of human made structures in u...

Building Information Modeling and Classification by Visual Learning At A City Scale

In this paper, we provide two case studies to demonstrate how artificial...

SpaceNet MVOI: a Multi-View Overhead Imagery Dataset

Detection and segmentation of objects in overheard imagery is a challeng...

Privacy Protection in Street-View Panoramas using Depth and Multi-View Imagery

The current paradigm in privacy protection in street-view images is to d...

Automated Building Image Extraction from 360-degree Panoramas for Post-Disaster Evaluation

After a disaster, teams of structural engineers collect vast amounts of ...