Building Facade Parsing R-CNN

05/12/2022
by   Sijie Wang, et al.
3

Building facade parsing, which predicts pixel-level labels for building facades, has applications in computer vision perception for autonomous vehicle (AV) driving. However, instead of a frontal view, an on-board camera of an AV captures a deformed view of the facade of the buildings on both sides of the road the AV is travelling on, due to the camera perspective. We propose Facade R-CNN, which includes a transconv module, generalized bounding box detection, and convex regularization, to perform parsing of deformed facade views. Experiments demonstrate that Facade R-CNN achieves better performance than the current state-of-the-art facade parsing models, which are primarily developed for frontal views. We also publish a new building facade parsing dataset derived from the Oxford RobotCar dataset, which we call the Oxford RobotCar Facade dataset. This dataset contains 500 street-view images from the Oxford RobotCar dataset augmented with accurate annotations of building facade objects. The published dataset is available at https://github.com/sijieaaa/Oxford-RobotCar-Facade

READ FULL TEXT

page 3

page 5

page 8

page 9

research
10/03/2020

Bounding Boxes Are All We Need: Street View Image Classification via Context Encoding of Detected Buildings

Street view images have been increasingly used in tasks like urban land ...
research
09/20/2020

Renovating Parsing R-CNN for Accurate Multiple Human Parsing

Multiple human parsing aims to segment various human parts and associate...
research
07/21/2021

Window Detection In Facade Imagery: A Deep Learning Approach Using Mask R-CNN

The parsing of windows in building facades is a long-desired but challen...
research
06/02/2021

Translational Symmetry-Aware Facade Parsing for 3D Building Reconstruction

Effectively parsing the facade is essential to 3D building reconstructio...
research
04/28/2022

Learning to Extract Building Footprints from Off-Nadir Aerial Images

Extracting building footprints from aerial images is essential for preci...
research
09/21/2023

Beyond Image Borders: Learning Feature Extrapolation for Unbounded Image Composition

For improving image composition and aesthetic quality, most existing met...
research
03/11/2021

Robust 2D/3D Vehicle Parsing in CVIS

We present a novel approach to robustly detect and perceive vehicles in ...

Please sign up or login with your details

Forgot password? Click here to reset