Self-Supervised Video Object Segmentation by Motion-Aware Mask Propagation

07/27/2021
by   Bo Miao, et al.
0

We propose a self-supervised spatio-temporal matching method coined Motion-Aware Mask Propagation (MAMP) for semi-supervised video object segmentation. During training, MAMP leverages the frame reconstruction task to train the model without the need for annotations. During inference, MAMP extracts high-resolution features from each frame to build a memory bank from the features as well as the predicted masks of selected past frames. MAMP then propagates the masks from the memory bank to subsequent frames according to our motion-aware spatio-temporal matching module, also proposed in this paper. Evaluation on DAVIS-2017 and YouTube-VOS datasets show that MAMP achieves state-of-the-art performance with stronger generalization ability compared to existing self-supervised methods, i.e. 4.9% higher mean 𝒥&ℱ on DAVIS-2017 and 4.85% higher mean 𝒥&ℱ on the unseen categories of YouTube-VOS than the nearest competitor. Moreover, MAMP performs on par with many supervised video object segmentation methods. Our code is available at: <https://github.com/bo-miao/MAMP>.

READ FULL TEXT

page 3

page 5

page 8

research
07/21/2022

Region Aware Video Object Segmentation with Deep Motion Modeling

Current semi-supervised video object segmentation (VOS) methods usually ...
research
07/16/2022

Learning Quality-aware Dynamic Memory for Video Object Segmentation

Recently, several spatial-temporal memory-based methods have verified th...
research
08/08/2021

Joint Inductive and Transductive Learning for Video Object Segmentation

Semi-supervised video object segmentation is a task of segmenting the ta...
research
09/23/2021

Hierarchical Memory Matching Network for Video Object Segmentation

We present Hierarchical Memory Matching Network (HMMN) for semi-supervis...
research
07/26/2021

Efficient Video Object Segmentation with Compressed Video

We propose an efficient inference framework for semi-supervised video ob...
research
11/29/2021

MUNet: Motion Uncertainty-aware Semi-supervised Video Object Segmentation

The task of semi-supervised video object segmentation (VOS) has been gre...
research
05/08/2022

Recurrent Dynamic Embedding for Video Object Segmentation

Space-time memory (STM) based video object segmentation (VOS) networks u...

Please sign up or login with your details

Forgot password? Click here to reset