Graph Attention Tracking

by   Dongyan Guo, et al.

Siamese network based trackers formulate the visual tracking task as a similarity matching problem. Almost all popular Siamese trackers realize the similarity learning via convolutional feature cross-correlation between a target branch and a search branch. However, since the size of target feature region needs to be pre-fixed, these cross-correlation base methods suffer from either reserving much adverse background information or missing a great deal of foreground information. Moreover, the global matching between the target and search region also largely neglects the target structure and part-level information. In this paper, to solve the above issues, we propose a simple target-aware Siamese graph attention network for general object tracking. We propose to establish part-to-part correspondence between the target and the search region with a complete bipartite graph, and apply the graph attention mechanism to propagate target information from the template feature to the search feature. Further, instead of using the pre-fixed region cropping for template-feature-area selection, we investigate a target-aware area selection mechanism to fit the size and aspect ratio variations of different objects. Experiments on challenging benchmarks including GOT-10k, UAV123, OTB-100 and LaSOT demonstrate that the proposed SiamGAT outperforms many state-of-the-art trackers and achieves leading performance. Code is available at:


page 1

page 2

page 7


3D Siamese Transformer Network for Single Object Tracking on Point Clouds

Siamese network based trackers formulate 3D single object tracking as cr...

Spatio-Temporal Matching for Siamese Visual Tracking

Similarity matching is a core operation in Siamese trackers. Most Siames...

Correlation-Aware Deep Tracking

Robustness and discrimination power are two fundamental requirements in ...

SiamRPN++: Evolution of Siamese Visual Tracking with Very Deep Networks

Siamese network based trackers formulate tracking as convolutional featu...

GradNet: Gradient-Guided Network for Visual Object Tracking

The fully-convolutional siamese network based on template matching has s...

Scale Equivariance Improves Siamese Tracking

Siamese trackers turn tracking into similarity estimation between a temp...

SRRT: Search Region Regulation Tracking

Dominant trackers generate a fixed-size rectangular region based on the ...