MOTR: End-to-End Multiple-Object Tracking with TRansformer

05/07/2021
by   Fangao Zeng, et al.
0

The key challenge in multiple-object tracking (MOT) task is temporal modeling of the object under track. Existing tracking-by-detection methods adopt simple heuristics, such as spatial or appearance similarity. Such methods, in spite of their commonality, are overly simple and insufficient to model complex variations, such as tracking through occlusion. Inherently, existing methods lack the ability to learn temporal variations from data. In this paper, we present MOTR, the first fully end-to-end multiple-object tracking framework. It learns to model the long-range temporal variation of the objects. It performs temporal association implicitly and avoids previous explicit heuristics. Built on Transformer and DETR, MOTR introduces the concept of "track query". Each track query models the entire track of an object. It is transferred and updated frame-by-frame to perform object detection and tracking, in a seamless manner. Temporal aggregation network combined with multi-frame training is proposed to model the long-range temporal relation. Experimental results show that MOTR achieves state-of-the-art performance. Code is available at https://github.com/megvii-model/MOTR.

READ FULL TEXT

page 3

page 7

page 8

research
12/31/2020

TransTrack: Multiple-Object Tracking with Transformer

Multiple-object tracking(MOT) is mostly dominated by complex and multi-s...
research
06/10/2020

TubeTK: Adopting Tubes to Track Multi-Object in a One-Step Training Model

Multi-object tracking is a fundamental vision problem that has been stud...
research
01/07/2021

TrackFormer: Multi-Object Tracking with Transformers

We present TrackFormer, an end-to-end multi-object tracking and segmenta...
research
07/28/2023

MeMOTR: Long-Term Memory-Augmented Transformer for Multi-Object Tracking

As a video task, Multiple Object Tracking (MOT) is expected to capture t...
research
08/11/2023

Collaborative Tracking Learning for Frame-Rate-Insensitive Multi-Object Tracking

Multi-object tracking (MOT) at low frame rates can reduce computational,...
research
08/06/2022

Transformer-based assignment decision network for multiple object tracking

Data association is a crucial component for any multiple object tracking...
research
11/09/2021

Video Text Tracking With a Spatio-Temporal Complementary Model

Text tracking is to track multiple texts in a video,and construct a traj...

Please sign up or login with your details

Forgot password? Click here to reset