Swin transformers make strong contextual encoders for VHR image road extraction

01/10/2022
by   Tao Chen, et al.
9

Significant progress has been made in automatic road extra-ction or segmentation based on deep learning, but there are still margins to improve in terms of the completeness and connectivity of the results. This is mainly due to the challenges of large intra-class variances, ambiguous inter-class distinctions, and occlusions from shadows, trees, and buildings. Therefore, being able to perceive global context and model geometric information is essential to further improve the accuracy of road segmentation. In this paper, we design a novel dual-branch encoding block CoSwin which exploits the capability of global context modeling of Swin Transformer and that of local feature extraction of ResNet. Furthermore, we also propose a context-guided filter block named CFilter, which can filter out context-independent noisy features for better reconstructing of the details. We use CoSwin and CFilter in a U-shaped network architecture. Experiments on Massachusetts and CHN6-CUG datasets show that the proposed method outperforms other state-of-the-art methods on the metrics of F1, IoU, and OA. Further analysis reveals that the improvement in accuracy comes from better integrity and connectivity of segmented roads.

READ FULL TEXT

page 1

page 4

research
05/14/2020

Direction-aware Residual Network for Road Extraction in VHR Remote Sensing Images

The binary segmentation of roads in very high resolution (VHR) remote se...
research
04/25/2023

STM-UNet: An Efficient U-shaped Architecture Based on Swin Transformer and Multi-scale MLP for Medical Image Segmentation

Automated medical image segmentation can assist doctors to diagnose fast...
research
12/17/2021

Full Transformer Framework for Robust Point Cloud Registration with Deep Information Interaction

Recent Transformer-based methods have achieved advanced performance in p...
research
02/18/2023

MultiScale Probability Map guided Index Pooling with Attention-based learning for Road and Building Segmentation

Efficient road and building footprint extraction from satellite images a...
research
07/08/2020

Designing and Training of A Dual CNN for Image Denoising

Deep convolutional neural networks (CNNs) for image denoising have recen...
research
08/28/2018

Iterative Deep Learning for Road Topology Extraction

This paper tackles the task of estimating the topology of road networks ...
research
09/18/2020

TopNet: Topology Preserving Metric Learning for Vessel Tree Reconstruction and Labelling

Reconstructing Portal Vein and Hepatic Vein trees from contrast enhanced...

Please sign up or login with your details

Forgot password? Click here to reset