DeepAI AI Chat
Log In Sign Up

Keypoints Tracking via Transformer Networks

by   Oleksii Nasypanyi, et al.

In this thesis, we propose a pioneering work on sparse keypoints tracking across images using transformer networks. While deep learning-based keypoints matching have been widely investigated using graph neural networks - and more recently transformer networks, they remain relatively too slow to operate in real-time and are particularly sensitive to the poor repeatability of the keypoints detectors. In order to address these shortcomings, we propose to study the particular case of real-time and robust keypoints tracking. Specifically, we propose a novel architecture which ensures a fast and robust estimation of the keypoints tracking between successive images of a video sequence. Our method takes advantage of a recent breakthrough in computer vision, namely, visual transformer networks. Our method consists of two successive stages, a coarse matching followed by a fine localization of the keypoints' correspondences prediction. Through various experiments, we demonstrate that our approach achieves competitive results and demonstrates high robustness against adverse conditions, such as illumination change, occlusion and viewpoint differences.


page 1

page 5

page 6


Transformer Meets Tracker: Exploiting Temporal Context for Robust Visual Tracking

In video object tracking, there exist rich temporal contexts among succe...

Siamese Transformer Pyramid Networks for Real-Time UAV Tracking

Recent object tracking methods depend upon deep networks or convoluted a...

DAGformer: Directed Acyclic Graph Transformer

In many fields, such as natural language processing and computer vision,...

Residual Swin Transformer Channel Attention Network for Image Demosaicing

Image demosaicing is problem of interpolating full- resolution color ima...

SPM-Tracker: Series-Parallel Matching for Real-Time Visual Object Tracking

The greatest challenge facing visual object tracking is the simultaneous...

Joint Deep Multi-Graph Matching and 3D Geometry Learning from Inhomogeneous 2D Image Collections

Graph matching aims to establish correspondences between vertices of gra...

Code Repositories


Keypoints Tracking via Transformer Networks

view repo