S^2-FPN: Scale-ware Strip Attention Guided Feature Pyramid Network for Real-time Semantic Segmentation

06/15/2022
by   Mohammed A. M. Elhassan, et al.
0

Modern high-performance semantic segmentation methods employ a heavy backbone and dilated convolution to extract the relevant feature. Although extracting features with both contextual and semantic information is critical for the segmentation tasks, it brings a memory footprint and high computation cost for real-time applications. This paper presents a new model to achieve a trade-off between accuracy/speed for real-time road scene semantic segmentation. Specifically, we proposed a lightweight model named Scale-aware Strip Attention Guided Feature Pyramid Network (S^2-FPN). Our network consists of three main modules: Attention Pyramid Fusion (APF) module, Scale-aware Strip Attention Module (SSAM), and Global Feature Upsample (GFU) module. APF adopts an attention mechanisms to learn discriminative multi-scale features and help close the semantic gap between different levels. APF uses the scale-aware attention to encode global context with vertical stripping operation and models the long-range dependencies, which helps relate pixels with similar semantic label. In addition, APF employs channel-wise reweighting block (CRB) to emphasize the channel features. Finally, the decoder of S^2-FPN then adopts GFU, which is used to fuse features from APF and the encoder. Extensive experiments have been conducted on two challenging semantic segmentation benchmarks, which demonstrate that our approach achieves better accuracy/speed trade-off with different model settings. The proposed models have achieved a results of 76.2%mIoU/87.3FPS, 77.4%mIoU/67FPS, and 77.8%mIoU/30.5FPS on Cityscapes dataset, and 69.6%mIoU,71.0% mIoU, and 74.2% mIoU on Camvid dataset. The code for this work will be made available at <https://github.com/mohamedac29/S2-FPN>

READ FULL TEXT

page 3

page 8

page 9

research
04/06/2022

PP-LiteSeg: A Superior Real-Time Semantic Segmentation Model

Real-world applications have high demands for semantic segmentation meth...
research
06/04/2023

Cross-CBAM: A Lightweight network for Scene Segmentation

Scene parsing is a great challenge for real-time semantic segmentation. ...
research
02/16/2021

Feature Pyramid Network with Multi-Head Attention for Semantic Segmentation of Fine-Resolution Remotely Sensed Images

Semantic segmentation from fine-resolution remotely sensed images is an ...
research
12/24/2021

Multi-Scale Feature Fusion: Learning Better Semantic Segmentation for Road Pothole Detection

This paper presents a novel pothole detection approach based on single-m...
research
11/05/2021

AGPCNet: Attention-Guided Pyramid Context Networks for Infrared Small Target Detection

Infrared small target detection is an important problem in many fields s...
research
10/04/2022

ASAP: Accurate semantic segmentation for real time performance

Feature fusion modules from encoder and self-attention module have been ...
research
12/14/2021

PP-HumanSeg: Connectivity-Aware Portrait Segmentation with a Large-Scale Teleconferencing Video Dataset

As the COVID-19 pandemic rampages across the world, the demands of video...

Please sign up or login with your details

Forgot password? Click here to reset