Multi-interactive Encoder-decoder Network for RGBT Salient Object Detection

05/05/2020
by   Zhengzheng Tu, et al.
0

RGBT salient object detection (SOD) aims to segment the common prominent regions by exploring and exploiting the complementary information of visible and thermal infrared images. However, existing methods simply integrate features of these two modalities, and thus could not explore the potentials of their complementarity. In this paper, we propose a novel multi-interactive encoder-decoder network to achieve an elaborative fusion for RGBT SOD. Our network relies on an encoder-decoder for the feature extraction and fusion, and we design a multi-interaction block (MIB) to model the interactions of different modalities, different layers and local-global information. In particular, we interact and integrate the multi-level features of different modalities in a two-stream decoder, which could fuse modal information sufficiently while maintaining their own specific feature representations for more robust detection performance. Moreover, each MIB block accepts both information from previous MIB and global context to restore more spatial details and object semantics respectively. Extensive experiments on the existing RGBT SOD datasets show that the proposed method achieves outstanding performance against the state-of-the-art algorithms.

READ FULL TEXT

page 3

page 11

research
10/28/2022

PSFormer: Point Transformer for 3D Salient Object Detection

We propose PSFormer, an effective point transformer model for 3D salient...
research
04/29/2021

Video Salient Object Detection via Adaptive Local-Global Refinement

Video salient object detection (VSOD) is an important task in many visio...
research
12/23/2022

Multi-Projection Fusion and Refinement Network for Salient Object Detection in 360° Omnidirectional Image

Salient object detection (SOD) aims to determine the most visually attra...
research
12/14/2021

TRACER: Extreme Attention Guided Salient Object Tracing Network

Existing studies on salient object detection (SOD) focus on extracting d...
research
01/19/2022

TransFuse: A Unified Transformer-based Image Fusion Framework using Self-supervised Learning

Image fusion is a technique to integrate information from multiple sourc...
research
02/20/2020

Stroke Constrained Attention Network for Online Handwritten Mathematical Expression Recognition

In this paper, we propose a novel stroke constrained attention network (...
research
09/08/2017

Learning to Segment Breast Biopsy Whole Slide Images

We trained and applied an encoder-decoder model to semantically segment ...

Please sign up or login with your details

Forgot password? Click here to reset