RPM-Net: Robust Pixel-Level Matching Networks for Self-Supervised Video Object Segmentation

09/29/2019
by   Youngeun Kim, et al.
13

In this paper, we introduce a self-supervised approach for video object segmentation without human labeled data.Specifically, we present Robust Pixel-level Matching Net-works (RPM-Net), a novel deep architecture that matches pixels between adjacent frames, using only color information from unlabeled videos for training. Technically, RPM-Net can be separated in two main modules. The embed-ding module first projects input images into high dimensional embedding space. Then the matching module with deformable convolution layers matches pixels between reference and target frames based on the embedding features.Unlike previous methods using deformable convolution, our matching module adopts deformable convolution to focus on similar features in spatio-temporally neighboring pixels.Our experiments show that the selective feature sampling improves the robustness to challenging problems in video object segmentation such as camera shake, fast motion, deformation, and occlusion. Also, we carry out comprehensive experiments on three public datasets (i.e., DAVIS-2017,SegTrack-v2, and Youtube-Objects) and achieve state-of-the-art performance on self-supervised video object seg-mentation. Moreover, we significantly reduce the performance gap between self-supervised and fully-supervised video object segmentation (41.0 validation set)

READ FULL TEXT

page 1

page 2

page 3

page 6

page 8

page 9

research
09/18/2020

PMVOS: Pixel-Level Matching-Based Video Object Segmentation

Semi-supervised video object segmentation (VOS) aims to segment arbitrar...
research
02/14/2022

Box Supervised Video Segmentation Proposal Network

Video Object Segmentation (VOS) has been targeted by various fully-super...
research
10/04/2021

Pixel-Level Bijective Matching for Video Object Segmentation

Semi-supervised video object segmentation (VOS) aims to track the design...
research
10/28/2019

Self-supervised learning of class embeddings from video

This work explores how to use self-supervised learning on videos to lear...
research
05/02/2022

Boosting Video Object Segmentation based on Scale Inconsistency

We present a refinement framework to boost the performance of pre-traine...
research
03/02/2017

LSB Matching Steganalysis Based on Patterns of Pixel Differences and Random Embedding

This paper presents a novel method for detection of LSB matching stegano...
research
04/22/2022

Self-Supervised Video Object Segmentation via Cutout Prediction and Tagging

We propose a novel self-supervised Video Object Segmentation (VOS) appro...

Please sign up or login with your details

Forgot password? Click here to reset