HODINet: High-Order Discrepant Interaction Network for RGB-D Salient Object Detection

07/03/2023
by   Kang Yi, et al.
0

RGB-D salient object detection (SOD) aims to detect the prominent regions by jointly modeling RGB and depth information. Most RGB-D SOD methods apply the same type of backbones and fusion modules to identically learn the multimodality and multistage features. However, these features contribute differently to the final saliency results, which raises two issues: 1) how to model discrepant characteristics of RGB images and depth maps; 2) how to fuse these cross-modality features in different stages. In this paper, we propose a high-order discrepant interaction network (HODINet) for RGB-D SOD. Concretely, we first employ transformer-based and CNN-based architectures as backbones to encode RGB and depth features, respectively. Then, the high-order representations are delicately extracted and embedded into spatial and channel attentions for cross-modality feature fusion in different stages. Specifically, we design a high-order spatial fusion (HOSF) module and a high-order channel fusion (HOCF) module to fuse features of the first two and the last two stages, respectively. Besides, a cascaded pyramid reconstruction network is adopted to progressively decode the fused features in a top-down pathway. Extensive experiments are conducted on seven widely used datasets to demonstrate the effectiveness of the proposed approach. We achieve competitive performance against 24 state-of-the-art methods under four evaluation metrics.

READ FULL TEXT

page 1

page 4

page 9

page 10

page 11

research
07/14/2020

RGB-D Salient Object Detection with Cross-Modality Modulation and Selection

We present an effective method to progressively integrate and refine the...
research
10/12/2020

Learning Selective Mutual Attention and Contrast for RGB-D Saliency Detection

How to effectively fuse cross-modal information is the key problem for R...
research
10/12/2022

PSNet: Parallel Symmetric Network for Video Salient Object Detection

For the video salient object detection (VSOD) task, how to excavate the ...
research
02/18/2020

High-Order Paired-ASPP Networks for Semantic Segmenation

Current semantic segmentation models only exploit first-order statistics...
research
12/01/2020

A Unified Structure for Efficient RGB and RGB-D Salient Object Detection

Salient object detection (SOD) has been well studied in recent years, es...
research
06/07/2022

Dual Swin-Transformer based Mutual Interactive Network for RGB-D Salient Object Detection

Salient Object Detection is the task of predicting the human attended re...
research
10/10/2021

Modality-Guided Subnetwork for Salient Object Detection

Recent RGBD-based models for saliency detection have attracted research ...

Please sign up or login with your details

Forgot password? Click here to reset