DeepAI AI Chat
Log In Sign Up

Efficient Visual Tracking with Exemplar Transformers

by   Philippe Blatter, et al.
ETH Zurich

The design of more complex and powerful neural network models has significantly advanced the state-of-the-art in visual object tracking. These advances can be attributed to deeper networks, or to the introduction of new building blocks, such as transformers. However, in the pursuit of increased tracking performance, efficient tracking architectures have received surprisingly little attention. In this paper, we introduce the Exemplar Transformer, an efficient transformer for real-time visual object tracking. E.T.Track, our visual tracker that incorporates Exemplar Transformer layers, runs at 47 fps on a CPU. This is up to 8 times faster than other transformer-based models, making it the only real-time transformer-based tracker. When compared to lightweight trackers that can operate in real-time on standard CPUs, E.T.Track consistently outperforms all other methods on the LaSOT, OTB-100, NFS, TrackingNet and VOT-ST2020 datasets. The code will soon be released on


page 1

page 2

page 3

page 4


Strong-TransCenter: Improved Multi-Object Tracking based on Transformers with Dense Representations

Transformer networks have been a focus of research in many fields in rec...

Efficient Visual Tracking via Hierarchical Cross-Attention Transformer

In recent years, target tracking has made great progress in accuracy. Th...

ZippyPoint: Fast Interest Point Detection, Description, and Matching through Mixed Precision Discretization

The design of more complex and powerful neural network models has signif...

Divert More Attention to Vision-Language Tracking

Relying on Transformer for complex visual feature learning, object track...

Siamese Transformer Pyramid Networks for Real-Time UAV Tracking

Recent object tracking methods depend upon deep networks or convoluted a...

CoViT: Real-time phylogenetics for the SARS-CoV-2 pandemic using Vision Transformers

Real-time viral genome detection, taxonomic classification and phylogene...

TransCenter: Transformers with Dense Queries for Multiple-Object Tracking

Transformer networks have proven extremely powerful for a wide variety o...