RefineVIS: Video Instance Segmentation with Temporal Attention Refinement

06/07/2023
by   Andre Abrantes, et al.
0

We introduce a novel framework called RefineVIS for Video Instance Segmentation (VIS) that achieves good object association between frames and accurate segmentation masks by iteratively refining the representations using sequence context. RefineVIS learns two separate representations on top of an off-the-shelf frame-level image instance segmentation model: an association representation responsible for associating objects across frames and a segmentation representation that produces accurate segmentation masks. Contrastive learning is utilized to learn temporally stable association representations. A Temporal Attention Refinement (TAR) module learns discriminative segmentation representations by exploiting temporal relationships and a novel temporal contrastive denoising technique. Our method supports both online and offline inference. It achieves state-of-the-art video instance segmentation accuracy on YouTube-VIS 2019 (64.4 AP), Youtube-VIS 2021 (61.4 AP), and OVIS (46.1 AP) datasets. The visualization shows that the TAR module can generate more accurate instance segmentation masks, particularly for challenging cases such as highly occluded objects.

READ FULL TEXT

page 2

page 4

page 10

research
06/14/2022

Consistent Video Instance Segmentation with Inter-Frame Recurrent Attention

Video instance segmentation aims at predicting object segmentation masks...
research
07/21/2022

In Defense of Online Models for Video Instance Segmentation

In recent years, video instance segmentation (VIS) has been largely adva...
research
10/14/2022

Instance Segmentation with Cross-Modal Consistency

Segmenting object instances is a key task in machine perception, with sa...
research
06/09/2022

VITA: Video Instance Segmentation via Object Token Association

We introduce a novel paradigm for offline Video Instance Segmentation (V...
research
12/20/2021

Mask2Former for Video Instance Segmentation

We find Mask2Former also achieves state-of-the-art performance on video ...
research
03/30/2023

MobileInst: Video Instance Segmentation on the Mobile

Although recent approaches aiming for video instance segmentation have a...
research
11/12/2018

Learning Segmentation Masks with the Independence Prior

An instance with a bad mask might make a composite image that uses it lo...

Please sign up or login with your details

Forgot password? Click here to reset