Classifying, Segmenting, and Tracking Object Instances in Video with Mask Propagation

12/10/2019
by   Gedas Bertasius, et al.
26

We introduce a method for simultaneously classifying, segmenting and tracking object instances in a video sequence. Our method, named MaskProp, adapts the popular Mask R-CNN to video by adding a mask propagation branch that propagates frame-level object instance masks from each video frame to all the other frames in a video clip. This allows our system to predict clip-level instance tracks with respect to the object instances segmented in the middle frame of the clip. Clip-level instance tracks generated densely for each frame in the sequence are finally aggregated to produce video-level object instance segmentation and classification. Our experiments demonstrate that our clip-level instance segmentation makes our approach robust to motion blur and object occlusions in video. MaskProp achieves the best reported accuracy on the YouTube-VIS dataset, outperforming the ICCV 2019 video instance segmentation challenge winner despite being much simpler and using orders of magnitude less labeled data (1.3M vs 1B images and 860K vs 14M bounding boxes)

READ FULL TEXT

page 1

page 4

page 6

page 8

research
11/30/2020

End-to-End Video Instance Segmentation with Transformers

Video instance segmentation (VIS) is the task that requires simultaneous...
research
09/30/2019

LIP: Learning Instance Propagation for Video Object Segmentation

In recent years, the task of segmenting foreground objects from backgrou...
research
12/08/2016

Learning Video Object Segmentation from Static Images

Inspired by recent advances of deep learning in instance segmentation an...
research
10/22/2021

1st Place Solution for the UVO Challenge on Video-based Open-World Segmentation 2021

In this report, we introduce our (pretty straightforard) two-step "detec...
research
06/07/2021

Contextual Guided Segmentation Framework for Semi-supervised Video Instance Segmentation

In this paper, we propose Contextual Guided Segmentation (CGS) framework...
research
04/22/2022

Tag-Based Attention Guided Bottom-Up Approach for Video Instance Segmentation

Video Instance Segmentation is a fundamental computer vision task that d...
research
09/14/2017

Learning to Segment Instances in Videos with Spatial Propagation Network

We propose a deep learning-based framework for instance-level object seg...

Please sign up or login with your details

Forgot password? Click here to reset