Learning Target-oriented Dual Attention for Robust RGB-T Tracking

08/12/2019
by   Rui Yang, et al.
2

RGB-Thermal object tracking attempt to locate target object using complementary visual and thermal infrared data. Existing RGB-T trackers fuse different modalities by robust feature representation learning or adaptive modal weighting. However, how to integrate dual attention mechanism for visual tracking is still a subject that has not been studied yet. In this paper, we propose two visual attention mechanisms for robust RGB-T object tracking. Specifically, the local attention is implemented by exploiting the common visual attention of RGB and thermal data to train deep classifiers. We also introduce the global attention, which is a multi-modal target-driven attention estimation network. It can provide global proposals for the classifier together with local proposals extracted from previous tracking result. Extensive experiments on two RGB-T benchmark datasets validated the effectiveness of our proposed algorithm.

READ FULL TEXT

page 4

page 7

research
01/23/2022

Visual Object Tracking on Multi-modal RGB-D Videos: A Review

The development of visual object tracking has continued for decades. Rec...
research
11/25/2018

Describe and Attend to Track: Learning Natural Language guided Structural Representation and Visual Attention for Object Tracking

The tracking-by-detection framework requires a set of positive and negat...
research
10/09/2018

Deep Attentive Tracking via Reciprocative Learning

Visual attention, derived from cognitive neuroscience, facilitates human...
research
01/22/2022

Temporal Aggregation for Adaptive RGBT Tracking

Visual object tracking with RGB and thermal infrared (TIR) spectra avail...
research
01/21/2022

Exploring Fusion Strategies for Accurate RGBT Visual Object Tracking

We address the problem of multi-modal object tracking in video and explo...
research
08/10/2021

Multi-domain Collaborative Feature Representation for Robust Visual Object Tracking

Jointly exploiting multiple different yet complementary domain informati...
research
11/08/2021

Cross-Modal Object Tracking: Modality-Aware Representations and A Unified Benchmark

In many visual systems, visual tracking often bases on RGB image sequenc...

Please sign up or login with your details

Forgot password? Click here to reset