Modular Interactive Video Object Segmentation: Interaction-to-Mask, Propagation and Difference-Aware Fusion

03/14/2021
by   Ho Kei Cheng, et al.
23

We present Modular interactive VOS (MiVOS) framework which decouples interaction-to-mask and mask propagation, allowing for higher generalizability and better performance. Trained separately, the interaction module converts user interactions to an object mask, which is then temporally propagated by our propagation module using a novel top-k filtering strategy in reading the space-time memory. To effectively take the user's intent into account, a novel difference-aware module is proposed to learn how to properly fuse the masks before and after each interaction, which are aligned with the target frames by employing the space-time memory. We evaluate our method both qualitatively and quantitatively with different forms of user interactions (e.g., scribbles, clicks) on DAVIS to show that our method outperforms current state-of-the-art algorithms while requiring fewer frame interactions, with the additional advantage in generalizing to different types of user interactions. We contribute a large-scale synthetic VOS dataset with pixel-accurate segmentation of 4.8M frames to accompany our source codes to facilitate future research.

READ FULL TEXT

page 1

page 3

page 4

page 6

page 8

research
03/03/2022

Revisiting Click-based Interactive Video Object Segmentation

While current methods for interactive Video Object Segmentation (iVOS) r...
research
04/21/2021

Guided Interactive Video Object Segmentation Using Reliability-Based Attention Maps

We propose a novel guided interactive segmentation (GIS) algorithm for v...
research
08/26/2023

PE-MED: Prompt Enhancement for Interactive Medical Image Segmentation

Interactive medical image segmentation refers to the accurate segmentati...
research
04/01/2019

Video Object Segmentation using Space-Time Memory Networks

We propose a novel solution for semi-supervised video object segmentatio...
research
11/29/2021

MUNet: Motion Uncertainty-aware Semi-supervised Video Object Segmentation

The task of semi-supervised video object segmentation (VOS) has been gre...
research
03/30/2020

Memory Aggregation Networks for Efficient Interactive Video Object Segmentation

Interactive video object segmentation (iVOS) aims at efficiently harvest...
research
04/13/2023

Boosting Video Object Segmentation via Space-time Correspondence Learning

Current top-leading solutions for video object segmentation (VOS) typica...

Please sign up or login with your details

Forgot password? Click here to reset