GroupTransNet: Group Transformer Network for RGB-D Salient Object Detection

03/21/2022
by   Xian Fang, et al.
0

Salient object detection on RGB-D images is an active topic in computer vision. Although the existing methods have achieved appreciable performance, there are still some challenges. The locality of convolutional neural network requires that the model has a sufficiently deep global receptive field, which always leads to the loss of local details. To address the challenge, we propose a novel Group Transformer Network (GroupTransNet) for RGB-D salient object detection. This method is good at learning the long-range dependencies of cross layer features to promote more perfect feature expression. At the beginning, the features of the slightly higher classes of the middle three levels and the latter three levels are soft grouped to absorb the advantages of the high-level features. The input features are repeatedly purified and enhanced by the attention mechanism to purify the cross modal features of color modal and depth modal. The features of the intermediate process are first fused by the features of different layers, and then processed by several transformers in multiple groups, which not only makes the size of the features of each scale unified and interrelated, but also achieves the effect of sharing the weight of the features within the group. The output features in different groups complete the clustering staggered by two owing to the level difference, and combine with the low-level features. Extensive experiments demonstrate that GroupTransNet outperforms the comparison models and achieves the new state-of-the-art performance.

READ FULL TEXT

page 1

page 3

page 7

page 9

research
08/09/2021

TriTransNet: RGB-D Salient Object Detection with a Triplet Transformer Embedding Network

Salient object detection is the pixel-level dense prediction task which ...
research
05/10/2017

Learning RGB-D Salient Object Detection using background enclosure, depth contrast, and top-down features

Recently, deep Convolutional Neural Networks (CNN) have demonstrated str...
research
04/12/2022

SwinNet: Swin Transformer drives edge-aware RGB-D and RGB-T salient object detection

Convolutional neural networks (CNNs) are good at extracting contexture f...
research
09/15/2023

Salient Object Detection in Optical Remote Sensing Images Driven by Transformer

Existing methods for Salient Object Detection in Optical Remote Sensing ...
research
04/28/2021

Learning Synergistic Attention for Light Field Salient Object Detection

We propose a novel Synergistic Attention Network (SA-Net) to address the...
research
09/10/2022

Large-Field Contextual Feature Learning for Glass Detection

Glass is very common in our daily life. Existing computer vision systems...
research
03/07/2021

GANav: Group-wise Attention Network for Classifying Navigable Regions in Unstructured Outdoor Environments

We present a new learning-based method for identifying safe and navigabl...

Please sign up or login with your details

Forgot password? Click here to reset