MAIN: Multi-Attention Instance Network for Video Segmentation

04/11/2019
by   Juan Leon Alcazar, et al.
0

Instance-level video segmentation requires a solid integration of spatial and temporal information. However, current methods rely mostly on domain-specific information (online learning) to produce accurate instance-level segmentations. We propose a novel approach that relies exclusively on the integration of generic spatio-temporal attention cues. Our strategy, named Multi-Attention Instance Network (MAIN), overcomes challenging segmentation scenarios over arbitrary videos without modelling sequence- or instance-specific knowledge. We design MAIN to segment multiple instances in a single forward pass, and optimize it with a novel loss function that favors class agnostic predictions and assigns instance-specific penalties. We achieve state-of-the-art performance on the challenging Youtube-VOS dataset and benchmark, improving the unseen Jaccard and F-Metric by 6.8 real-time (30.3 FPS).

READ FULL TEXT

page 1

page 2

page 4

page 8

page 12

page 13

page 14

research
09/21/2023

TCOVIS: Temporally Consistent Online Video Instance Segmentation

In recent years, significant progress has been made in video instance se...
research
01/20/2023

Towards Robust Video Instance Segmentation with Temporal-Aware Transformer

Most existing transformer based video instance segmentation methods extr...
research
04/22/2022

Tag-Based Attention Guided Bottom-Up Approach for Video Instance Segmentation

Video Instance Segmentation is a fundamental computer vision task that d...
research
04/13/2021

Crossover Learning for Fast Online Video Instance Segmentation

Modeling temporal visual context across frames is critical for video ins...
research
12/19/2019

Learning a Spatio-Temporal Embedding for Video Instance Segmentation

We present a novel embedding approach for video instance segmentation. O...
research
03/22/2023

Tube-Link: A Flexible Cross Tube Baseline for Universal Video Segmentation

The goal of video segmentation is to accurately segment and track every ...
research
04/10/2021

Target-Aware Object Discovery and Association for Unsupervised Video Multi-Object Segmentation

This paper addresses the task of unsupervised video multi-object segment...

Please sign up or login with your details

Forgot password? Click here to reset