TrTr: Visual Tracking with Transformer

05/09/2021
by   Moju Zhao, et al.
0

Template-based discriminative trackers are currently the dominant tracking methods due to their robustness and accuracy, and the Siamese-network-based methods that depend on cross-correlation operation between features extracted from template and search images show the state-of-the-art tracking performance. However, general cross-correlation operation can only obtain relationship between local patches in two feature maps. In this paper, we propose a novel tracker network based on a powerful attention mechanism called Transformer encoder-decoder architecture to gain global and rich contextual interdependencies. In this new architecture, features of the template image is processed by a self-attention module in the encoder part to learn strong context information, which is then sent to the decoder part to compute cross-attention with the search image features processed by another self-attention module. In addition, we design the classification and regression heads using the output of Transformer to localize target based on shape-agnostic anchor. We extensively evaluate our tracker TrTr, on VOT2018, VOT2019, OTB-100, UAV, NfS, TrackingNet, and LaSOT benchmarks and our method performs favorably against state-of-the-art algorithms. Training code and pretrained models are available at https://github.com/tongtybj/TrTr.

READ FULL TEXT

page 1

page 8

research
03/29/2021

Transformer Tracking

Correlation acts as a critical role in the tracking field, especially in...
research
01/26/2023

Compact Transformer Tracker with Correlative Masked Modeling

Transformer framework has been showing superior performances in visual o...
research
02/23/2023

A Convolutional-Transformer Network for Crack Segmentation with Boundary Awareness

Cracks play a crucial role in assessing the safety and durability of man...
research
03/25/2022

High-Performance Transformer Tracking

Correlation has a critical role in the tracking field, especially in rec...
research
07/20/2022

AiATrack: Attention in Attention for Transformer Visual Tracking

Transformer trackers have achieved impressive advancements recently, whe...
research
03/22/2022

Joint Feature Learning and Relation Modeling for Tracking: A One-Stream Framework

The current popular two-stream, two-stage tracking framework extracts th...
research
08/01/2022

Local Perception-Aware Transformer for Aerial Tracking

Transformer-based visual object tracking has been utilized extensively. ...

Please sign up or login with your details

Forgot password? Click here to reset