Transformer Meets Tracker: Exploiting Temporal Context for Robust Visual Tracking

03/22/2021
by   Ning Wang, et al.
0

In video object tracking, there exist rich temporal contexts among successive frames, which have been largely overlooked in existing trackers. In this work, we bridge the individual video frames and explore the temporal contexts across them via a transformer architecture for robust object tracking. Different from classic usage of the transformer in natural language processing tasks, we separate its encoder and decoder into two parallel branches and carefully design them within the Siamese-like tracking pipelines. The transformer encoder promotes the target templates via attention-based feature reinforcement, which benefits the high-quality tracking model generation. The transformer decoder propagates the tracking cues from previous templates to the current frame, which facilitates the object searching process. Our transformer-assisted tracking framework is neat and trained in an end-to-end manner. With the proposed transformer, a simple Siamese matching approach is able to outperform the current top-performing trackers. By combining our transformer with the recent discriminative tracking pipeline, our method sets several new state-of-the-art records on prevalent tracking benchmarks.

READ FULL TEXT

page 5

page 12

research
01/07/2021

TrackFormer: Multi-Object Tracking with Transformers

We present TrackFormer, an end-to-end multi-object tracking and segmenta...
research
03/24/2022

Keypoints Tracking via Transformer Networks

In this thesis, we propose a pioneering work on sparse keypoints trackin...
research
09/06/2023

Efficient Training for Visual Tracking with Deformable Transformer

Recent Transformer-based visual tracking models have showcased superior ...
research
07/21/2019

Tracking Holistic Object Representations

Recent advances in visual tracking are based on siamese feature extracto...
research
11/23/2020

Siamese Tracking with Lingual Object Constraints

Classically, visual object tracking involves following a target object t...
research
04/27/2023

SeqTrack: Sequence to Sequence Learning for Visual Object Tracking

In this paper, we present a new sequence-to-sequence learning framework ...
research
09/15/2023

Leveraging the Power of Data Augmentation for Transformer-based Tracking

Due to long-distance correlation and powerful pretrained models, transfo...

Please sign up or login with your details

Forgot password? Click here to reset