Bootstrapping Objectness from Videos by Relaxed Common Fate and Visual Grouping

04/17/2023
by   Long Lian, et al.
0

We study learning object segmentation from unlabeled videos. Humans can easily segment moving objects without knowing what they are. The Gestalt law of common fate, i.e., what move at the same speed belong together, has inspired unsupervised object discovery based on motion segmentation. However, common fate is not a reliable indicator of objectness: Parts of an articulated / deformable object may not move at the same speed, whereas shadows / reflections of an object always move with it but are not part of it. Our insight is to bootstrap objectness by first learning image features from relaxed common fate and then refining them based on visual appearance grouping within the image itself and across images statistically. Specifically, we learn an image segmenter first in the loop of approximating optical flow with constant segment flow plus small within-segment residual flow, and then by refining it for more coherent appearance and statistical figure-ground relevance. On unsupervised video object segmentation, using only ResNet and convolutional heads, our model surpasses the state-of-the-art by absolute gains of 7/9/5 effectiveness of our ideas. Our code is publicly available.

READ FULL TEXT

page 1

page 5

page 8

page 13

page 14

page 15

page 16

research
11/11/2021

The Emergence of Objectness: Learning Zero-Shot Segmentation from Videos

Humans can easily segment moving objects without knowing what they are. ...
research
05/16/2022

Guess What Moves: Unsupervised Video and Image Segmentation by Anticipating Motion

Motion, measured via optical flow, provides a powerful cue to discover a...
research
12/01/2017

Learning to Segment Moving Objects

We study the problem of segmenting moving objects in unconstrained video...
research
12/19/2016

Learning Features by Watching Objects Move

This paper presents a novel yet intuitive approach to unsupervised featu...
research
12/17/2022

Improving Unsupervised Video Object Segmentation with Motion-Appearance Synergy

We present IMAS, a method that segments the primary objects in videos wi...
research
08/22/2023

LOCATE: Self-supervised Object Discovery via Flow-guided Graph-cut and Bootstrapped Self-training

Learning object segmentation in image and video datasets without human s...
research
07/25/2023

Optical Flow boosts Unsupervised Localization and Segmentation

Unsupervised localization and segmentation are long-standing robot visio...

Please sign up or login with your details

Forgot password? Click here to reset