Motion-inductive Self-supervised Object Discovery in Videos

10/01/2022
by   Shuangrui Ding, et al.
7

In this paper, we consider the task of unsupervised object discovery in videos. Previous works have shown promising results via processing optical flows to segment objects. However, taking flow as input brings about two drawbacks. First, flow cannot capture sufficient cues when objects remain static or partially occluded. Second, it is challenging to establish temporal coherency from flow-only input, due to the missing texture information. To tackle these limitations, we propose a model for directly processing consecutive RGB frames, and infer the optical flow between any pair of frames using a layered representation, with the opacity channels being treated as the segmentation. Additionally, to enforce object permanence, we apply temporal consistency loss on the inferred masks from randomly-paired frames, which refer to the motions at different paces, and encourage the model to segment the objects even if they may not move at the current time point. Experimentally, we demonstrate superior performance over previous state-of-the-art methods on three public video segmentation datasets (DAVIS2016, SegTrackv2, and FBMS-59), while being computationally efficient by avoiding the overhead of computing optical flow as input.

READ FULL TEXT

page 2

page 3

page 7

page 14

page 15

research
04/15/2021

Self-supervised Video Object Segmentation by Motion Grouping

Animals have evolved highly functional visual systems to understand moti...
research
08/31/2023

STint: Self-supervised Temporal Interpolation for Geospatial Data

Supervised and unsupervised techniques have demonstrated the potential f...
research
03/08/2023

TSANET: Temporal and Scale Alignment for Unsupervised Video Object Segmentation

Unsupervised Video Object Segmentation (UVOS) refers to the challenging ...
research
02/11/2020

Self-Supervised Object-in-Gripper Segmentation from Robotic Motions

We present a novel technique to automatically generate annotated data fo...
research
09/08/2022

Unsupervised Video Object Segmentation via Prototype Memory Network

Unsupervised video object segmentation aims to segment a target object i...
research
07/05/2022

Segmenting Moving Objects via an Object-Centric Layered Representation

The objective of this paper is a model that is able to discover, track a...
research
12/03/2020

Learning to Transfer Visual Effects from Videos to Images

We study the problem of animating images by transferring spatio-temporal...

Please sign up or login with your details

Forgot password? Click here to reset