M^3Net: Multilevel, Mixed and Multistage Attention Network for Salient Object Detection

09/15/2023
by   Yao Yuan, et al.
0

Most existing salient object detection methods mostly use U-Net or feature pyramid structure, which simply aggregates feature maps of different scales, ignoring the uniqueness and interdependence of them and their respective contributions to the final prediction. To overcome these, we propose the M^3Net, i.e., the Multilevel, Mixed and Multistage attention network for Salient Object Detection (SOD). Firstly, we propose Multiscale Interaction Block which innovatively introduces the cross-attention approach to achieve the interaction between multilevel features, allowing high-level features to guide low-level feature learning and thus enhancing salient regions. Secondly, considering the fact that previous Transformer based SOD methods locate salient regions only using global self-attention while inevitably overlooking the details of complex objects, we propose the Mixed Attention Block. This block combines global self-attention and window self-attention, aiming at modeling context at both global and local levels to further improve the accuracy of the prediction map. Finally, we proposed a multilevel supervision strategy to optimize the aggregated feature stage-by-stage. Experiments on six challenging datasets demonstrate that the proposed M^3Net surpasses recent CNN and Transformer-based SOD arts in terms of four metrics. Codes are available at https://github.com/I2-Multimedia-Lab/M3Net.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 8

page 9

research
06/23/2022

YOLOSA: Object detection based on 2D local feature superimposed self-attention

We analyzed the network structure of real-time object detection models a...
research
04/30/2020

Salient Object Detection Combining a Self-attention Module and a Feature Pyramid Network

Salient object detection has achieved great improvement by using the Ful...
research
09/24/2020

Local Context Attention for Salient Object Segmentation

Salient object segmentation aims at distinguishing various salient objec...
research
06/24/2022

Excavating RoI Attention for Underwater Object Detection

Self-attention is one of the most successful designs in deep learning, w...
research
07/16/2020

Suppress and Balance: A Simple Gated Network for Salient Object Detection

Most salient object detection approaches use U-Net or feature pyramid ne...
research
05/23/2022

SelfReformer: Self-Refined Network with Transformer for Salient Object Detection

The global and local contexts significantly contribute to the integrity ...
research
02/25/2020

Cross-layer Feature Pyramid Network for Salient Object Detection

Feature pyramid network (FPN) based models, which fuse the semantics and...

Please sign up or login with your details

Forgot password? Click here to reset