VideoCutLER: Surprisingly Simple Unsupervised Video Instance Segmentation

08/28/2023
by   Xudong Wang, et al.
0

Existing approaches to unsupervised video instance segmentation typically rely on motion estimates and experience difficulties tracking small or divergent motions. We present VideoCutLER, a simple method for unsupervised multi-instance video segmentation without using motion-based learning signals like optical flow or training on natural videos. Our key insight is that using high-quality pseudo masks and a simple video synthesis method for model training is surprisingly sufficient to enable the resulting video model to effectively segment and track multiple instances across video frames. We show the first competitive unsupervised learning results on the challenging YouTubeVIS-2019 benchmark, achieving 50.7 state-of-the-art by a large margin. VideoCutLER can also serve as a strong pretrained model for supervised video instance segmentation tasks, exceeding DINO by 15.9

READ FULL TEXT

page 1

page 2

page 6

page 11

page 12

page 13

page 14

page 15

research
12/13/2018

Design Pseudo Ground Truth with Motion Cue for Unsupervised Video Object Segmentation

One major technique debt in video object segmentation is to label the ob...
research
03/25/2023

UnScene3D: Unsupervised 3D Instance Segmentation for Indoor Scenes

3D instance segmentation is fundamental to geometric understanding of th...
research
12/19/2018

Unsupervised Video Object Segmentation with Distractor-Aware Online Adaptation

Unsupervised video object segmentation is a crucial application in video...
research
12/15/2022

Solve the Puzzle of Instance Segmentation in Videos: A Weakly Supervised Framework with Spatio-Temporal Collaboration

Instance segmentation in videos, which aims to segment and track multipl...
research
02/25/2022

Weakly Supervised Instance Segmentation using Motion Information via Optical Flow

Weakly supervised instance segmentation has gained popularity because it...
research
12/19/2019

Learning a Spatio-Temporal Embedding for Video Instance Segmentation

We present a novel embedding approach for video instance segmentation. O...
research
12/12/2013

Unsupervised learning of depth and motion

We present a model for the joint estimation of disparity and motion. The...

Please sign up or login with your details

Forgot password? Click here to reset