High-Performance Transformer Tracking

03/25/2022
by   Xin Chen, et al.
0

Correlation has a critical role in the tracking field, especially in recent popular Siamese-based trackers. The correlation operation is a simple fusion manner to consider the similarity between the template and the search region. However, the correlation operation is a local linear matching process, losing semantic information and falling into local optimum easily, which may be the bottleneck of designing high-accuracy tracking algorithms. In this work, to determine whether a better feature fusion method exists than correlation, a novel attention-based feature fusion network, inspired by Transformer, is presented. This network effectively combines the template and the search region features using attention. Specifically, the proposed method includes an ego-context augment module based on self-attention and a cross-feature augment module based on cross-attention. First, we present a Transformer tracking (named TransT) method based on the Siamese-like feature extraction backbone, the designed attention-based fusion mechanism, and the classification and regression head. Based on the TransT baseline, we further design a segmentation branch to generate an accurate mask. Finally, we propose a stronger version of TransT by extending TransT with a multi-template design and an IoU prediction head, named TransT-M. Experiments show that our TransT and TransT-M methods achieve promising results on seven popular datasets. Code and models are available at https://github.com/chenxin-dlut/TransT-M.

READ FULL TEXT

page 1

page 4

page 5

page 6

research
03/29/2021

Transformer Tracking

Correlation acts as a critical role in the tracking field, especially in...
research
05/09/2021

TrTr: Visual Tracking with Transformer

Template-based discriminative trackers are currently the dominant tracki...
research
08/18/2022

Learning Spatial-Frequency Transformer for Visual Object Tracking

Recent trackers adopt the Transformer to combine or replace the widely u...
research
03/18/2023

ABC: Attention with Bilinear Correlation for Infrared Small Target Detection

Infrared small target detection (ISTD) has a wide range of applications ...
research
01/22/2023

DASTSiam: Spatio-Temporal Fusion and Discriminative Augmentation for Improved Siamese Tracking

Tracking tasks based on deep neural networks have greatly improved with ...
research
12/06/2022

AbHE: All Attention-based Homography Estimation

Homography estimation is a basic computer vision task, which aims to obt...
research
05/25/2023

MixFormerV2: Efficient Fully Transformer Tracking

Transformer-based trackers have achieved strong accuracy on the standard...

Please sign up or login with your details

Forgot password? Click here to reset