Learning Discriminative Feature with CRF for Unsupervised Video Object Segmentation

08/04/2020
by   Mingmin Zhen, et al.
3

In this paper, we introduce a novel network, called discriminative feature network (DFNet), to address the unsupervised video object segmentation task. To capture the inherent correlation among video frames, we learn discriminative features (D-features) from the input images that reveal feature distribution from a global perspective. The D-features are then used to establish correspondence with all features of test image under conditional random field (CRF) formulation, which is leveraged to enforce consistency between pixels. The experiments verify that DFNet outperforms state-of-the-art methods by a large margin with a mean IoU score of 83.4 leaderboard while using much fewer parameters and achieving much more efficient performance in the inference phase. We further evaluate DFNet on the FBMS dataset and the video saliency dataset ViSal, reaching a new state-of-the-art. To further demonstrate the generalizability of our framework, DFNet is also applied to the image object co-segmentation task. We perform experiments on a challenging dataset PASCAL-VOC and observe the superiority of DFNet. The thorough experiments verify that DFNet is able to capture and mine the underlying relations of images and discover the common foreground objects.

READ FULL TEXT

page 3

page 10

page 11

page 12

research
01/19/2020

See More, Know More: Unsupervised Video Object Segmentation with Co-Attention Siamese Networks

We introduce a novel network, called CO-attention Siamese Network (COSNe...
research
01/19/2020

Zero-Shot Video Object Segmentation via Attentive Graph Neural Networks

This work proposes a novel attentive graph neural network (AGNN) for zer...
research
05/21/2020

Unsupervised segmentation via semantic-apparent feature fusion

Foreground segmentation is an essential task in the field of image under...
research
04/07/2017

High-Quality Correspondence and Segmentation Estimation for Dual-Lens Smart-Phone Portraits

Estimating correspondence between two images and extracting the foregrou...
research
06/12/2016

Human Centred Object Co-Segmentation

Co-segmentation is the automatic extraction of the common semantic regio...
research
01/29/2018

End-to-End Fine-Grained Action Segmentation and Recognition Using Conditional Random Field Models and Discriminative Sparse Coding

Fine-grained action segmentation and recognition is an important yet cha...
research
02/16/2016

Segmentation Rectification for Video Cutout via One-Class Structured Learning

Recent works on interactive video object cutout mainly focus on designin...

Please sign up or login with your details

Forgot password? Click here to reset