SelfReformer: Self-Refined Network with Transformer for Salient Object Detection

05/23/2022
by   Yi Ke Yun, et al.
0

The global and local contexts significantly contribute to the integrity of predictions in Salient Object Detection (SOD). Unfortunately, existing methods still struggle to generate complete predictions with fine details. There are two major problems in conventional approaches: first, for global context, high-level CNN-based encoder features cannot effectively catch long-range dependencies, resulting in incomplete predictions. Second, downsampling the ground truth to fit the size of predictions will introduce inaccuracy as the ground truth details are lost during interpolation or pooling. Thus, in this work, we developed a Transformer-based network and framed a supervised task for a branch to learn the global context information explicitly. Besides, we adopt Pixel Shuffle from Super-Resolution (SR) to reshape the predictions back to the size of ground truth instead of the reverse. Thus details in the ground truth are untouched. In addition, we developed a two-stage Context Refinement Module (CRM) to fuse global context and automatically locate and refine the local details in the predictions. The proposed network can guide and correct itself based on the global and local context generated, thus is named, Self-Refined Transformer (SelfReformer). Extensive experiments and evaluation results on five benchmark datasets demonstrate the outstanding performance of the network, and we achieved the state-of-the-art.

READ FULL TEXT

page 1

page 3

page 4

page 5

page 7

research
08/17/2021

Boosting Salient Object Detection with Transformer-based Asymmetric Bilateral U-Net

Existing salient object detection (SOD) methods mainly rely on CNN-based...
research
03/02/2020

Global Context-Aware Progressive Aggregation Network for Salient Object Detection

Deep convolutional neural networks have achieved competitive performance...
research
08/10/2022

Ghost-free High Dynamic Range Imaging with Context-aware Transformer

High dynamic range (HDR) deghosting algorithms aim to generate ghost-fre...
research
03/16/2022

EDTER: Edge Detection with Transformer

Convolutional neural networks have made significant progresses in edge d...
research
09/15/2023

M^3Net: Multilevel, Mixed and Multistage Attention Network for Salient Object Detection

Most existing salient object detection methods mostly use U-Net or featu...
research
04/21/2022

Transformer-Guided Convolutional Neural Network for Cross-View Geolocalization

Ground-to-aerial geolocalization refers to localizing a ground-level que...
research
07/16/2020

Suppress and Balance: A Simple Gated Network for Salient Object Detection

Most salient object detection approaches use U-Net or feature pyramid ne...

Please sign up or login with your details

Forgot password? Click here to reset