Tracking Anything in High Quality

07/26/2023
by   Jiawen Zhu, et al.
0

Visual object tracking is a fundamental video task in computer vision. Recently, the notably increasing power of perception algorithms allows the unification of single/multiobject and box/mask-based tracking. Among them, the Segment Anything Model (SAM) attracts much attention. In this report, we propose HQTrack, a framework for High Quality Tracking anything in videos. HQTrack mainly consists of a video multi-object segmenter (VMOS) and a mask refiner (MR). Given the object to be tracked in the initial frame of a video, VMOS propagates the object masks to the current frame. The mask results at this stage are not accurate enough since VMOS is trained on several closeset video object segmentation (VOS) datasets, which has limited ability to generalize to complex and corner scenes. To further improve the quality of tracking masks, a pretrained MR model is employed to refine the tracking results. As a compelling testament to the effectiveness of our paradigm, without employing any tricks such as test-time data augmentations and model ensemble, HQTrack ranks the 2nd place in the Visual Object Tracking and Segmentation (VOTS2023) challenge. Code and models are available at https://github.com/jiawen-zhu/HQTrack.

READ FULL TEXT

page 3

page 5

research
07/05/2023

ZJU ReLER Submission for EPIC-KITCHEN Challenge 2023: TREK-150 Single Object Tracking

The Associating Objects with Transformers (AOT) framework has exhibited ...
research
08/25/2023

Integrating Boxes and Masks: A Multi-Object Framework for Unified Visual Tracking and Segmentation

Tracking any given object(s) spatially and temporally is a common purpos...
research
05/22/2023

UVOSAM: A Mask-free Paradigm for Unsupervised Video Object Segmentation via Segment Anything Model

Unsupervised video object segmentation has made significant progress in ...
research
12/27/2022

1st Place Solution for YouTubeVOS Challenge 2022: Referring Video Object Segmentation

The task of referring video object segmentation aims to segment the obje...
research
10/25/2019

Learning to Track Any Object

Object tracking can be formulated as "finding the right object in a vide...
research
06/01/2022

Differentiable Soft-Masked Attention

Transformers have become prevalent in computer vision due to their perfo...
research
11/13/2020

Image Animation with Perturbed Masks

We present a novel approach for image-animation of a source image by a d...

Please sign up or login with your details

Forgot password? Click here to reset