RGB-T Tracking Based on Mixed Attention

04/09/2023
by   Yang Luo, et al.
0

RGB-T tracking involves the use of images from both visible and thermal modalities. The primary objective is to adaptively leverage the relatively dominant modality in varying conditions to achieve more robust tracking compared to single-modality tracking. An RGB-T tracker based on mixed attention mechanism to achieve complementary fusion of modalities (referred to as MACFT) is proposed in this paper. In the feature extraction stage, we utilize different transformer backbone branches to extract specific and shared information from different modalities. By performing mixed attention operations in the backbone to enable information interaction and self-enhancement between the template and search images, it constructs a robust feature representation that better understands the high-level semantic features of the target. Then, in the feature fusion stage, a modality-adaptive fusion is achieved through a mixed attention-based modality fusion network, which suppresses the low-quality modality noise while enhancing the information of the dominant modality. Evaluation on multiple RGB-T public datasets demonstrates that our proposed tracker outperforms other RGB-T trackers on general evaluation metrics while also being able to adapt to longterm tracking scenarios.

READ FULL TEXT

page 2

page 4

page 6

page 11

research
07/17/2019

Multi-Adapter RGBT Tracking

The task of RGBT tracking aims to take the complementary advantages from...
research
08/30/2019

Multi-Modal Fusion for End-to-End RGB-T Tracking

We propose an end-to-end tracking framework for fusing the RGB and TIR m...
research
03/26/2023

RGBT Tracking via Progressive Fusion Transformer with Dynamically Guided Learning

Existing Transformer-based RGBT tracking methods either use cross-attent...
research
03/12/2021

Siamese Infrared and Visible Light Fusion Network for RGB-T Tracking

Due to the different photosensitive properties of infrared and visible l...
research
09/16/2021

Dynamic Fusion Network for RGBT Tracking

For both visible and infrared images have their own advantages and disad...
research
08/31/2023

RGB-T Tracking via Multi-Modal Mutual Prompt Learning

Object tracking based on the fusion of visible and thermal im-ages, know...
research
07/24/2019

Dense Feature Aggregation and Pruning for RGBT Tracking

How to perform effective information fusion of different modalities is a...

Please sign up or login with your details

Forgot password? Click here to reset