DeepAI AI Chat
Log In Sign Up

Simultaneous Detection and Tracking with Motion Modelling for Multiple Object Tracking

by   ShiJie Sun, et al.

Deep learning-based Multiple Object Tracking (MOT) currently relies on off-the-shelf detectors for tracking-by-detection.This results in deep models that are detector biased and evaluations that are detector influenced. To resolve this issue, we introduce Deep Motion Modeling Network (DMM-Net) that can estimate multiple objects' motion parameters to perform joint detection and association in an end-to-end manner. DMM-Net models object features over multiple frames and simultaneously infers object classes, visibility, and their motion parameters. These outputs are readily used to update the tracklets for efficient MOT. DMM-Net achieves PR-MOTA score of 12.80 @ 120+ fps for the popular UA-DETRAC challenge, which is better performance and orders of magnitude faster. We also contribute a synthetic large-scale public dataset Omni-MOT for vehicle tracking that provides precise ground-truth annotations to eliminate the detector influence in MOT evaluation. This 14M+ frames dataset is extendable with our public script (Code at Dataset <>, Dataset Recorder <>, Omni-MOT Source <>). We demonstrate the suitability of Omni-MOT for deep learning with DMMNet and also make the source code of our network public.


MOTRv2: Bootstrapping End-to-End Multi-Object Tracking by Pretrained Object Detectors

In this paper, we propose MOTRv2, a simple yet effective pipeline to boo...

Improving Object Detection, Multi-object Tracking, and Re-Identification for Disaster Response Drones

We aim to detect and identify multiple objects using multiple cameras an...

FastTrack: an open-source software for tracking varying numbers of deformable objects

Analyzing the dynamical properties of mobile objects requires to extract...

Digital Twin Tracking Dataset (DTTD): A New RGB+Depth 3D Dataset for Longer-Range Object Tracking Applications

Digital twin is a problem of augmenting real objects with their digital ...

E2ETag: An End-to-End Trainable Method for Generating and Detecting Fiducial Markers

Existing fiducial markers solutions are designed for efficient detection...

PP-YOLOv2: A Practical Object Detector

Being effective and efficient is essential to an object detector for pra...

Asynchronous Interaction Aggregation for Action Detection

Understanding interaction is an essential part of video action detection...