TransCAM: Transformer Attention-based CAM Refinement for Weakly Supervised Semantic Segmentation

03/14/2022
by   Ruiwen Li, et al.
0

Weakly supervised semantic segmentation (WSSS) with only image-level supervision is a challenging task. Most existing methods exploit Class Activation Maps (CAM) to generate pixel-level pseudo labels for supervised training. However, due to the local receptive field of Convolution Neural Networks (CNN), CAM applied to CNNs often suffers from partial activation – highlighting the most discriminative part instead of the entire object area. In order to capture both local features and global representations, the Conformer has been proposed to combine a visual transformer branch with a CNN branch. In this paper, we propose TransCAM, a Conformer-based solution to WSSS that explicitly leverages the attention weights from the transformer branch of the Conformer to refine the CAM generated from the CNN branch. TransCAM is motivated by our observation that attention weights from shallow transformer blocks are able to capture low-level spatial feature similarities while attention weights from deep transformer blocks capture high-level semantic context. Despite its simplicity, TransCAM achieves a new state-of-the-art performance of 69.3 test sets, showing the effectiveness of transformer attention-based refinement of CAM for WSSS.

READ FULL TEXT

page 1

page 4

page 5

page 8

page 13

page 14

research
09/30/2022

Dual Progressive Transformations for Weakly Supervised Semantic Segmentation

Weakly supervised semantic segmentation (WSSS), which aims to mine the o...
research
12/16/2021

Activation Modulation and Recalibration Scheme for Weakly Supervised Semantic Segmentation

Image-level weakly supervised semantic segmentation (WSSS) is a fundamen...
research
08/21/2023

CVFC: Attention-Based Cross-View Feature Consistency for Weakly Supervised Semantic Segmentation of Pathology Images

Histopathology image segmentation is the gold standard for diagnosing ca...
research
11/20/2022

Attention-based Class Activation Diffusion for Weakly-Supervised Semantic Segmentation

Extracting class activation maps (CAM) is a key step for weakly-supervis...
research
01/26/2023

Semantic Segmentation Enhanced Transformer Model for Human Attention Prediction

Saliency Prediction aims to predict the attention distribution of human ...
research
09/04/2023

Semantic-Constraint Matching Transformer for Weakly Supervised Object Localization

Weakly supervised object localization (WSOL) strives to learn to localiz...
research
02/16/2016

Deconvolutional Feature Stacking for Weakly-Supervised Semantic Segmentation

A weakly-supervised semantic segmentation framework with a tied deconvol...

Please sign up or login with your details

Forgot password? Click here to reset