1st Place Solution for CVPR2023 BURST Long Tail and Open World Challenges

08/08/2023
by   Kaer Huang, et al.
0

Currently, Video Instance Segmentation (VIS) aims at segmenting and categorizing objects in videos from a closed set of training categories that contain only a few dozen of categories, lacking the ability to handle diverse objects in real-world videos. As TAO and BURST datasets release, we have the opportunity to research VIS in long-tailed and open-world scenarios. Traditional VIS methods are evaluated on benchmarks limited to a small number of common classes, But practical applications require trackers that go beyond these common classes, detecting and tracking rare and even never-before-seen objects. Inspired by the latest MOT paper for the long tail task (Tracking Every Thing in the Wild, Siyuan Li et), for the BURST long tail challenge, we train our model on a combination of LVISv0.5 and the COCO dataset using repeat factor sampling. First, train the detector with segmentation and CEM on LVISv0.5 + COCO dataset. And then, train the instance appearance similarity head on the TAO dataset. at last, our method (LeTracker) gets 14.9 HOTAall in the BURST test set, ranking 1st in the benchmark. for the open-world challenges, we only use 64 classes (Intersection classes of BURST Train subset and COCO dataset, without LVIS dataset) annotations data training, and testing on BURST test set data and get 61.4 OWTAall, ranking 1st in the benchmark. Our code will be released to facilitate future research.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/04/2023

Towards Open-Vocabulary Video Instance Segmentation

Video Instance Segmentation(VIS) aims at segmenting and categorizing obj...
research
02/03/2022

The Met Dataset: Instance-level Recognition for Artworks

This work introduces a dataset for large-scale instance-level recognitio...
research
07/23/2020

The Devil is in Classification: A Simple Framework for Long-tail Instance Segmentation

Most existing object instance detection and segmentation models only wor...
research
04/02/2021

Unsupervised Discovery of the Long-Tail in Instance Segmentation Using Hierarchical Self-Supervision

Instance segmentation is an active topic in computer vision that is usua...
research
04/29/2019

A Study on Action Detection in the Wild

The recent introduction of the AVA dataset for action detection has caus...
research
04/10/2021

Unidentified Video Objects: A Benchmark for Dense, Open-World Segmentation

Current state-of-the-art object detection and segmentation methods work ...
research
10/29/2019

Classification Calibration for Long-tail Instance Segmentation

Remarkable progress has been made in object instance detection and segme...

Please sign up or login with your details

Forgot password? Click here to reset