Learning What to Learn for Video Object Segmentation

03/25/2020
by   Goutam Bhat, et al.
3

Video object segmentation (VOS) is a highly challenging problem, since the target object is only defined during inference with a given first-frame reference mask. The problem of how to capture and utilize this limited target information remains a fundamental research question. We address this by introducing an end-to-end trainable VOS architecture that integrates a differentiable few-shot learning module. This internal learner is designed to predict a powerful parametric model of the target by minimizing a segmentation error in the first frame. We further go beyond standard few-shot learning techniques by learning what the few-shot learner should learn. This allows us to achieve a rich internal representation of the target in the current frame, significantly increasing the segmentation accuracy of our approach. We perform extensive experiments on multiple benchmarks. Our approach sets a new state-of-the-art on the large-scale YouTube-VOS 2018 dataset by achieving an overall score of 81.5, corresponding to a 2.6 previous best result.

READ FULL TEXT

page 2

page 8

page 14

page 29

page 30

page 31

research
03/13/2019

RVOS: End-to-End Recurrent Network for Video Object Segmentation

Multiple object video object segmentation is a challenging task, special...
research
02/27/2020

Learning Fast and Robust Target Models for Video Object Segmentation

Video object segmentation (VOS) is a highly challenging problem since th...
research
07/11/2020

Fast Video Object Segmentation With Temporal Aggregation Network and Dynamic Template Matching

Significant progress has been made in Video Object Segmentation (VOS), t...
research
03/30/2021

Deep Gaussian Processes for Few-Shot Segmentation

Few-shot segmentation is a challenging task, requiring the extraction of...
research
10/10/2020

Hybrid Sequence to Sequence Model for Video Object Segmentation

One-shot Video Object Segmentation (VOS) is the task of pixel-wise track...
research
07/14/2020

Video Object Segmentation with Episodic Graph Memory Networks

How to make a segmentation model to efficiently adapt to a specific vide...
research
10/07/2021

Dense Gaussian Processes for Few-Shot Segmentation

Few-shot segmentation is a challenging dense prediction task, which enta...

Please sign up or login with your details

Forgot password? Click here to reset