Generating Masks from Boxes by Mining Spatio-Temporal Consistencies in Videos

01/06/2021
by   Bin Zhao, et al.
6

Segmenting objects in videos is a fundamental computer vision task. The current deep learning based paradigm offers a powerful, but data-hungry solution. However, current datasets are limited by the cost and human effort of annotating object masks in videos. This effectively limits the performance and generalization capabilities of existing video segmentation methods. To address this issue, we explore weaker form of bounding box annotations. We introduce a method for generating segmentation masks from per-frame bounding box annotations in videos. To this end, we propose a spatio-temporal aggregation module that effectively mines consistencies in the object and background appearance across multiple frames. We use our resulting accurate masks for weakly supervised training of video object segmentation (VOS) networks. We generate segmentation masks for large scale tracking datasets, using only their bounding box annotations. The additional data provides substantially better generalization performance leading to state-of-the-art results in both the VOS and more challenging tracking domain.

READ FULL TEXT

page 1

page 3

page 7

page 13

research
12/15/2022

Solve the Puzzle of Instance Segmentation in Videos: A Weakly Supervised Framework with Spatio-Temporal Collaboration

Instance segmentation in videos, which aims to segment and track multipl...
research
08/09/2021

Video Annotation for Visual Tracking via Selection and Refinement

Deep learning based visual trackers entail offline pre-training on large...
research
11/02/2020

Reducing the Annotation Effort for Video Object Segmentation Datasets

For further progress in video object segmentation (VOS), larger, more di...
research
11/19/2020

Towards Spatio-Temporal Video Scene Text Detection via Temporal Clustering

With only bounding-box annotations in the spatial domain, existing video...
research
12/12/2022

Breaking the "Object" in Video Object Segmentation

The appearance of an object can be fleeting when it transforms. As eggs ...
research
12/14/2020

Improving Panoptic Segmentation at All Scales

Crop-based training strategies decouple training resolution from GPU mem...
research
12/07/2022

BoxPolyp:Boost Generalized Polyp Segmentation Using Extra Coarse Bounding Box Annotations

Accurate polyp segmentation is of great importance for colorectal cancer...

Please sign up or login with your details

Forgot password? Click here to reset