PSNet: Parallel Symmetric Network for Video Salient Object Detection

10/12/2022
by   Runmin Cong, et al.
0

For the video salient object detection (VSOD) task, how to excavate the information from the appearance modality and the motion modality has always been a topic of great concern. The two-stream structure, including an RGB appearance stream and an optical flow motion stream, has been widely used as a typical pipeline for VSOD tasks, but the existing methods usually only use motion features to unidirectionally guide appearance features or adaptively but blindly fuse two modality features. However, these methods underperform in diverse scenarios due to the uncomprehensive and unspecific learning schemes. In this paper, following a more secure modeling philosophy, we deeply investigate the importance of appearance modality and motion modality in a more comprehensive way and propose a VSOD network with up and down parallel symmetry, named PSNet. Two parallel branches with different dominant modalities are set to achieve complete video saliency decoding with the cooperation of the Gather Diffusion Reinforcement (GDR) module and Cross-modality Refinement and Complement (CRC) module. Finally, we use the Importance Perception Fusion (IPF) module to fuse the features from two parallel branches according to their different importance in different scenarios. Experiments on four dataset benchmarks demonstrate that our method achieves desirable and competitive performance.

READ FULL TEXT

page 1

page 3

page 9

page 10

research
02/12/2022

Depth-Cooperated Trimodal Network for Video Salient Object Detection

Depth can provide useful geographical cues for salient object detection ...
research
07/03/2023

HODINet: High-Order Discrepant Interaction Network for RGB-D Salient Object Detection

RGB-D salient object detection (SOD) aims to detect the prominent region...
research
08/01/2019

Two-Stream Video Classification with Cross-Modality Attention

Fusing multi-modality information is known to be able to effectively bri...
research
10/06/2022

CIR-Net: Cross-modality Interaction and Refinement for RGB-D Salient Object Detection

Focusing on the issue of how to effectively capture and utilize cross-mo...
research
10/09/2022

Does Thermal Really Always Matter for RGB-T Salient Object Detection?

In recent years, RGB-T salient object detection (SOD) has attracted cont...
research
07/13/2022

Appearance-guided Attentive Self-Paced Learning for Unsupervised Salient Object Detection

Existing Deep-Learning-based (DL-based) Unsupervised Salient Object Dete...
research
07/26/2020

Challenge-Aware RGBT Tracking

RGB and thermal source data suffer from both shared and specific challen...

Please sign up or login with your details

Forgot password? Click here to reset