DeepAI AI Chat
Log In Sign Up

Online Adaptation of Convolutional Neural Networks for Video Object Segmentation

by   Paul Voigtlaender, et al.

We tackle the task of semi-supervised video object segmentation, i.e. segmenting the pixels belonging to an object in the video using the ground truth pixel mask for the first frame. We build on the recently introduced one-shot video object segmentation (OSVOS) approach which uses a pretrained network and fine-tunes it on the first frame. While achieving impressive performance, at test time OSVOS uses the fine-tuned network in unchanged form and is not able to adapt to large changes in object appearance. To overcome this limitation, we propose Online Adaptive Video Object Segmentation (OnAVOS) which updates the network online using training examples selected based on the confidence of the network and the spatial configuration. Additionally, we add a pretraining step based on objectness, which is learned on PASCAL. Our experiments show that both extensions are highly effective and improve the state of the art on DAVIS to an intersection-over-union score of 85.7


page 2

page 4


One-Shot Video Object Segmentation

This paper tackles the task of semi-supervised video object segmentation...

Make One-Shot Video Object Segmentation Efficient Again

Video object segmentation (VOS) describes the task of segmenting a set o...

Adaptive ROI Generation for Video Object Segmentation Using Reinforcement Learning

In this paper, we aim to tackle the task of semi-supervised video object...

Dynamic Video Segmentation Network

In this paper, we present a detailed design of dynamic video segmentatio...

Video-kMaX: A Simple Unified Approach for Online and Near-Online Video Panoptic Segmentation

Video Panoptic Segmentation (VPS) aims to achieve comprehensive pixel-le...

Collaborative Attention Memory Network for Video Object Segmentation

Semi-supervised video object segmentation is a fundamental yet Challengi...

SwiftNet: Real-time Video Object Segmentation

In this work we present SwiftNet for real-time semi-supervised video obj...