Tracking Objects as Points

04/02/2020
by   Xingyi Zhou, et al.
0

Tracking has traditionally been the art of following interest points through space and time. This changed with the rise of powerful deep networks. Nowadays, tracking is dominated by pipelines that perform object detection followed by temporal association, also known as tracking-by-detection. In this paper, we present a simultaneous detection and tracking algorithm that is simpler, faster, and more accurate than the state of the art. Our tracker, CenterTrack, applies a detection model to a pair of images and detections from the prior frame. Given this minimal input, CenterTrack localizes objects and predicts their associations with the previous frame. That's it. CenterTrack is simple, online (no peeking into the future), and real-time. It achieves 67.3 the MOT17 challenge at 22 FPS and 89.4 15 FPS, setting a new state of the art on both datasets. CenterTrack is easily extended to monocular 3D tracking by regressing additional 3D attributes. Using monocular video input, it achieves 28.3 nuScenes 3D tracking benchmark, substantially outperforming the monocular baseline on this benchmark while running at 28 FPS.

READ FULL TEXT

page 2

page 14

research
04/12/2021

Localization-Based Tracking

End-to-end production of object tracklets from high resolution video in ...
research
05/30/2022

Time3D: End-to-End Joint Monocular 3D Object Detection and Tracking for Autonomous Driving

While separately leveraging monocular 3D object detection and 2D multi-o...
research
10/11/2017

Detect to Track and Track to Detect

Recent approaches for high accuracy detection and tracking of object cat...
research
01/25/2015

An Occlusion Reasoning Scheme for Monocular Pedestrian Tracking in Dynamic Scenes

This paper looks into the problem of pedestrian tracking using a monocul...
research
05/06/2021

Detection, Tracking, and Counting Meets Drones in Crowds: A Benchmark

To promote the developments of object detection, tracking and counting a...
research
10/29/2021

Multi-target tracking for video surveillance using deep affinity network: a brief review

Deep learning models are known to function like the human brain. Due to ...
research
12/08/2021

Tracking People by Predicting 3D Appearance, Location Pose

In this paper, we present an approach for tracking people in monocular v...

Please sign up or login with your details

Forgot password? Click here to reset