AttaNet: Attention-Augmented Network for Fast and Accurate Scene Parsing

03/10/2021
by   Qi Song, et al.
0

Two factors have proven to be very important to the performance of semantic segmentation models: global context and multi-level semantics. However, generating features that capture both factors always leads to high computational complexity, which is problematic in real-time scenarios. In this paper, we propose a new model, called Attention-Augmented Network (AttaNet), to capture both global context and multilevel semantics while keeping the efficiency high. AttaNet consists of two primary modules: Strip Attention Module (SAM) and Attention Fusion Module (AFM). Viewing that in challenging images with low segmentation accuracy, there are a significantly larger amount of vertical strip areas than horizontal ones, SAM utilizes a striping operation to reduce the complexity of encoding global context in the vertical direction drastically while keeping most of contextual information, compared to the non-local approaches. Moreover, AFM follows a cross-level aggregation strategy to limit the computation, and adopts an attention strategy to weight the importance of different levels of features at each pixel when fusing them, obtaining an efficient multi-level representation. We have conducted extensive experiments on two semantic segmentation benchmarks, and our network achieves different levels of speed/accuracy trade-offs on Cityscapes, e.g., 71 FPS/79.9 mIoU, 130 FPS/78.5 ADE20K as well.

READ FULL TEXT

page 3

page 6

page 7

research
10/04/2022

ASAP: Accurate semantic segmentation for real time performance

Feature fusion modules from encoder and self-attention module have been ...
research
11/28/2022

Efficient Mirror Detection via Multi-level Heterogeneous Learning

We present HetNet (Multi-level Heterogeneous Network), a highly efficien...
research
08/25/2020

Dynamic deformable attention (DDANet) for semantic segmentation

Deep learning based medical image segmentation is an important step with...
research
10/19/2016

Mixed context networks for semantic segmentation

Semantic segmentation is challenging as it requires both object-level in...
research
07/02/2023

TopicFM+: Boosting Accuracy and Efficiency of Topic-Assisted Feature Matching

This study tackles the challenge of image matching in difficult scenario...
research
11/05/2019

Adaptive Context Network for Scene Parsing

Recent works attempt to improve scene parsing performance by exploring d...
research
09/15/2023

Efficient Polyp Segmentation Via Integrity Learning

Accurate polyp delineation in colonoscopy is crucial for assisting in di...

Please sign up or login with your details

Forgot password? Click here to reset