Tracking Objects as Pixel-wise Distributions

07/12/2022
by   Zelin Zhao, et al.
1

Multi-object tracking (MOT) requires detecting and associating objects through frames. Unlike tracking via detected bounding boxes or tracking objects as points, we propose tracking objects as pixel-wise distributions. We instantiate this idea on a transformer-based architecture, P3AFormer, with pixel-wise propagation, prediction, and association. P3AFormer propagates pixel-wise features guided by flow information to pass messages between frames. Furthermore, P3AFormer adopts a meta-architecture to produce multi-scale object feature maps. During inference, a pixel-wise association procedure is proposed to recover object connections through frames based on the pixel-wise prediction. P3AFormer yields 81.2% in terms of MOTA on the MOT17 benchmark – the first among all transformer networks to reach 80% MOTA in literature. P3AFormer also outperforms state-of-the-arts on the MOT20 and KITTI benchmarks.

READ FULL TEXT

page 2

page 12

research
11/20/2017

Pixel-wise object tracking

In this paper, we propose a novel pixel-wise visual object tracking fram...
research
08/18/2022

Pixel-Wise Prediction based Visual Odometry via Uncertainty Estimation

This paper introduces pixel-wise prediction based visual odometry (PWVO)...
research
10/17/2022

Track Targets by Dense Spatio-Temporal Position Encoding

In this work, we propose a novel paradigm to encode the position of targ...
research
05/08/2022

Transformer Tracking with Cyclic Shifting Window Attention

Transformer architecture has been showing its great strength in visual o...
research
09/04/2018

OCNet: Object Context Network for Scene Parsing

Context is essential for various computer vision tasks. The state-of-the...
research
07/09/2023

Reducing False Alarms in Video Surveillance by Deep Feature Statistical Modeling

Detecting relevant changes is a fundamental problem of video surveillanc...
research
06/28/2021

Prior-Induced Information Alignment for Image Matting

Image matting is an ill-posed problem that aims to estimate the opacity ...

Please sign up or login with your details

Forgot password? Click here to reset