Robust and Efficient Memory Network for Video Object Segmentation

04/24/2023
by   Yadang Chen, et al.
0

This paper proposes a Robust and Efficient Memory Network, referred to as REMN, for studying semi-supervised video object segmentation (VOS). Memory-based methods have recently achieved outstanding VOS performance by performing non-local pixel-wise matching between the query and memory. However, these methods have two limitations. 1) Non-local matching could cause distractor objects in the background to be incorrectly segmented. 2) Memory features with high temporal redundancy consume significant computing resources. For limitation 1, we introduce a local attention mechanism that tackles the background distraction by enhancing the features of foreground objects with the previous mask. For limitation 2, we first adaptively decide whether to update the memory features depending on the variation of foreground objects to reduce temporal redundancy. Second, we employ a dynamic memory bank, which uses a lightweight and differentiable soft modulation gate to decide how many memory features need to be removed in the temporal dimension. Experiments demonstrate that our REMN achieves state-of-the-art results on DAVIS 2017, with a 𝒥&ℱ score of 86.3 over mean of 85.5 25+ FPS and uses relatively few computing resources.

READ FULL TEXT

page 1

page 3

page 6

research
09/23/2021

Hierarchical Memory Matching Network for Video Object Segmentation

We present Hierarchical Memory Matching Network (HMMN) for semi-supervis...
research
08/03/2022

Per-Clip Video Object Segmentation

Recently, memory-based approaches show promising results on semi-supervi...
research
07/09/2021

Fast Pixel-Matching for Video Object Segmentation

Video object segmentation, aiming to segment the foreground objects give...
research
07/16/2020

Kernelized Memory Network for Video Object Segmentation

Semi-supervised video object segmentation (VOS) is a task that involves ...
research
10/13/2020

Collaborative Video Object Segmentation by Multi-Scale Foreground-Background Integration

This paper investigates the principles of embedding learning to tackle t...
research
02/09/2021

SwiftNet: Real-time Video Object Segmentation

In this work we present SwiftNet for real-time semi-supervised video obj...
research
02/15/2021

VA-RED^2: Video Adaptive Redundancy Reduction

Performing inference on deep learning models for videos remains a challe...

Please sign up or login with your details

Forgot password? Click here to reset