Improving Semantic Segmentation in Transformers using Hierarchical Inter-Level Attention

07/05/2022
by   Gary Leung, et al.
17

Existing transformer-based image backbones typically propagate feature information in one direction from lower to higher-levels. This may not be ideal since the localization ability to delineate accurate object boundaries, is most prominent in the lower, high-resolution feature maps, while the semantics that can disambiguate image signals belonging to one object vs. another, typically emerges in a higher level of processing. We present Hierarchical Inter-Level Attention (HILA), an attention-based method that captures Bottom-Up and Top-Down Updates between features of different levels. HILA extends hierarchical vision transformer architectures by adding local connections between features of higher and lower levels to the backbone encoder. In each iteration, we construct a hierarchy by having higher-level features compete for assignments to update lower-level features belonging to them, iteratively resolving object-part relationships. These improved lower-level features are then used to re-update the higher-level features. HILA can be integrated into the majority of hierarchical architectures without requiring any changes to the base model. We add HILA into SegFormer and the Swin Transformer and show notable improvements in accuracy in semantic segmentation with fewer parameters and FLOPS. Project website and code: https://www.cs.toronto.edu/ garyleung/hila/

READ FULL TEXT

page 13

page 34

page 35

page 36

page 37

page 38

page 39

page 40

research
12/18/2020

A Holistically-Guided Decoder for Deep Representation Learning with Applications to Semantic Segmentation and Object Detection

Both high-level and high-resolution feature representations are of great...
research
03/15/2023

HFGD: High-level Feature Guided Decoder for Semantic Segmentation

Commonly used backbones for semantic segmentation, such as ResNet and Sw...
research
05/10/2022

STDC-MA Network for Semantic Segmentation

Semantic segmentation is applied extensively in autonomous driving and i...
research
04/03/2019

GFF: Gated Fully Fusion for Semantic Segmentation

Semantic segmentation generates comprehensive understanding of scenes at...
research
10/15/2022

Prediction Calibration for Generalized Few-shot Semantic Segmentation

Generalized Few-shot Semantic Segmentation (GFSS) aims to segment each i...
research
02/07/2020

iqiyi Submission to ActivityNet Challenge 2019 Kinetics-700 challenge: Hierarchical Group-wise Attention

In this report, the method for the iqiyi submission to the task of Activ...
research
06/05/2017

Inconsistent Node Flattening for Improving Top-down Hierarchical Classification

Large-scale classification of data where classes are structurally organi...

Please sign up or login with your details

Forgot password? Click here to reset