Cannot See the Forest for the Trees: Aggregating Multiple Viewpoints to Better Classify Objects in Videos

06/05/2022
by   Sukjun Hwang, et al.
0

Recently, both long-tailed recognition and object tracking have made great advances individually. TAO benchmark presented a mixture of the two, long-tailed object tracking, in order to further reflect the aspect of the real-world. To date, existing solutions have adopted detectors showing robustness in long-tailed distributions, which derive per-frame results. Then, they used tracking algorithms that combine the temporally independent detections to finalize tracklets. However, as the approaches did not take temporal changes in scenes into account, inconsistent classification results in videos led to low overall performance. In this paper, we present a set classifier that improves accuracy of classifying tracklets by aggregating information from multiple viewpoints contained in a tracklet. To cope with sparse annotations in videos, we further propose augmentation of tracklets that can maximize data efficiency. The set classifier is plug-and-playable to existing object trackers, and highly improves the performance of long-tailed object tracking. By simply attaching our method to QDTrack on top of ResNet-101, we achieve the new state-of-the-art, 19.9 TAO validation and test sets, respectively.

READ FULL TEXT

page 1

page 3

page 5

research
12/04/2017

Long-Term Visual Object Tracking Benchmark

In this paper, we propose a new long video dataset (called Track Long an...
research
03/19/2020

MOT20: A benchmark for multi object tracking in crowded scenes

Standardized benchmarks are crucial for the majority of computer vision ...
research
01/09/2023

EgoTracks: A Long-term Egocentric Visual Object Tracking Dataset

Visual object tracking is a key component to many egocentric vision prob...
research
01/27/2020

ABCTracker: an easy-to-use, cloud-based application for tracking multiple objects

Visual multi-object tracking has the potential to accelerate many forms ...
research
06/21/2021

Multiple Object Tracking with Mixture Density Networks for Trajectory Estimation

Multiple object tracking faces several challenges that may be alleviated...
research
02/02/2022

Does Video Compression Impact Tracking Accuracy?

Everyone "knows" that compressing a video will degrade the accuracy of o...

Please sign up or login with your details

Forgot password? Click here to reset