MOTRv3: Release-Fetch Supervision for End-to-End Multi-Object Tracking

05/23/2023
by   En Yu, et al.
0

Although end-to-end multi-object trackers like MOTR enjoy the merits of simplicity, they suffer from the conflict between detection and association seriously, resulting in unsatisfactory convergence dynamics. While MOTRv2 partly addresses this problem, it demands an additional detection network for assistance. In this work, we serve as the first to reveal that this conflict arises from the unfair label assignment between detect queries and track queries during training, where these detect queries recognize targets and track queries associate them. Based on this observation, we propose MOTRv3, which balances the label assignment process using the developed release-fetch supervision strategy. In this strategy, labels are first released for detection and gradually fetched back for association. Besides, another two strategies named pseudo label distillation and track group denoising are designed to further improve the supervision for detection and association. Without the assistance of an extra detection network during inference, MOTRv3 achieves impressive performance across diverse benchmarks, e.g., MOT17, DanceTrack.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/22/2023

Bridging the Gap Between End-to-end and Non-End-to-end Multi-Object Tracking

Existing end-to-end Multi-Object Tracking (e2e-MOT) methods have not sur...
research
11/17/2022

MOTRv2: Bootstrapping End-to-End Multi-Object Tracking by Pretrained Object Detectors

In this paper, we propose MOTRv2, a simple yet effective pipeline to boo...
research
07/26/2022

Group DETR: Fast Training Convergence with Decoupled One-to-Many Label Assignment

Detection Transformer (DETR) relies on One-to-One label assignment, i.e....
research
08/06/2022

Transformer-based assignment decision network for multiple object tracking

Data association is a crucial component for any multiple object tracking...
research
11/25/2022

DATE: Dual Assignment for End-to-End Fully Convolutional Object Detection

Fully convolutional detectors discard the one-to-many assignment and ado...
research
06/08/2023

SparseTrack: Multi-Object Tracking by Performing Scene Decomposition based on Pseudo-Depth

Exploring robust and efficient association methods has always been an im...

Please sign up or login with your details

Forgot password? Click here to reset