TDT: Teaching Detectors to Track without Fully Annotated Videos

05/11/2022
by   Shuzhi Yu, et al.
0

Recently, one-stage trackers that use a joint model to predict both detections and appearance embeddings in one forward pass received much attention and achieved state-of-the-art results on the Multi-Object Tracking (MOT) benchmarks. However, their success depends on the availability of videos that are fully annotated with tracking data, which is expensive and hard to obtain. This can limit the model generalization. In comparison, the two-stage approach, which performs detection and embedding separately, is slower but easier to train as their data are easier to annotate. We propose to combine the best of the two worlds through a data distillation approach. Specifically, we use a teacher embedder, trained on Re-ID datasets, to generate pseudo appearance embedding labels for the detection datasets. Then, we use the augmented dataset to train a detector that is also capable of regressing these pseudo-embeddings in a fully-convolutional fashion. Our proposed one-stage solution matches the two-stage counterpart in quality but is 3 times faster. Even though the teacher embedder has not seen any tracking data during training, our proposed tracker achieves competitive performance with some popular trackers (e.g. JDE) trained with fully labeled tracking data.

READ FULL TEXT

page 8

page 14

research
02/02/2016

Simple Online and Realtime Tracking

This paper explores a pragmatic approach to multiple object tracking whe...
research
10/30/2020

SMOT: Single-Shot Multi Object Tracking

We present single-shot multi-object tracker (SMOT), a new tracking frame...
research
08/04/2022

SOMPT22: A Surveillance Oriented Multi-Pedestrian Tracking Dataset

Multi-object tracking (MOT) has been dominated by the use of track by de...
research
06/04/2020

Simple Unsupervised Multi-Object Tracking

Multi-object tracking has seen a lot of progress recently, albeit with s...
research
07/06/2021

Semi-TCL: Semi-Supervised Track Contrastive Representation Learning

Online tracking of multiple objects in videos requires strong capacity o...
research
07/24/2019

Teacher-Students Knowledge Distillation for Siamese Trackers

With the development of Siamese network based trackers, a variety of tec...

Please sign up or login with your details

Forgot password? Click here to reset