SwinNet: Swin Transformer drives edge-aware RGB-D and RGB-T salient object detection

04/12/2022
by   Zhengyi Liu, et al.
5

Convolutional neural networks (CNNs) are good at extracting contexture features within certain receptive fields, while transformers can model the global long-range dependency features. By absorbing the advantage of transformer and the merit of CNN, Swin Transformer shows strong feature representation ability. Based on it, we propose a cross-modality fusion model SwinNet for RGB-D and RGB-T salient object detection. It is driven by Swin Transformer to extract the hierarchical features, boosted by attention mechanism to bridge the gap between two modalities, and guided by edge information to sharp the contour of salient object. To be specific, two-stream Swin Transformer encoder first extracts multi-modality features, and then spatial alignment and channel re-calibration module is presented to optimize intra-level cross-modality features. To clarify the fuzzy boundary, edge-guided decoder achieves inter-level cross-modality fusion under the guidance of edge features. The proposed model outperforms the state-of-the-art models on RGB-D and RGB-T datasets, showing that it provides more insight into the cross-modality complementarity task.

READ FULL TEXT

page 1

page 3

page 7

page 8

page 9

page 10

research
01/08/2023

HRTransNet: HRFormer-Driven Two-Modality Salient Object Detection

The High-Resolution Transformer (HRFormer) can maintain high-resolution ...
research
10/30/2021

Cross-Modality Fusion Transformer for Multispectral Object Detection

Multispectral image pairs can provide the combined information, making o...
research
08/17/2023

Point-aware Interaction and CNN-induced Refinement Network for RGB-D Salient Object Detection

By integrating complementary information from RGB image and depth map, t...
research
09/21/2022

Position-Aware Relation Learning for RGB-Thermal Salient Object Detection

RGB-Thermal salient object detection (SOD) combines two spectra to segme...
research
03/21/2022

GroupTransNet: Group Transformer Network for RGB-D Salient Object Detection

Salient object detection on RGB-D images is an active topic in computer ...
research
07/04/2022

TANet: Transformer-based Asymmetric Network for RGB-D Salient Object Detection

Existing RGB-D SOD methods mainly rely on a symmetric two-stream CNN-bas...
research
10/10/2021

Modality-Guided Subnetwork for Salient Object Detection

Recent RGBD-based models for saliency detection have attracted research ...

Please sign up or login with your details

Forgot password? Click here to reset