SWEM: Towards Real-Time Video Object Segmentation with Sequential Weighted Expectation-Maximization

08/22/2022
by   Zhihui Lin, et al.
6

Matching-based methods, especially those based on space-time memory, are significantly ahead of other solutions in semi-supervised video object segmentation (VOS). However, continuously growing and redundant template features lead to an inefficient inference. To alleviate this, we propose a novel Sequential Weighted Expectation-Maximization (SWEM) network to greatly reduce the redundancy of memory features. Different from the previous methods which only detect feature redundancy between frames, SWEM merges both intra-frame and inter-frame similar features by leveraging the sequential weighted EM algorithm. Further, adaptive weights for frame features endow SWEM with the flexibility to represent hard samples, improving the discrimination of templates. Besides, the proposed method maintains a fixed number of template features in memory, which ensures the stable inference complexity of the VOS system. Extensive experiments on commonly used DAVIS and YouTube-VOS datasets verify the high efficiency (36 FPS) and high performance (84.3% 𝒥&ℱ on DAVIS 2017 validation dataset) of SWEM. Code is available at: https://github.com/lmm077/SWEM.

READ FULL TEXT

page 3

page 8

page 15

research
07/21/2022

Region Aware Video Object Segmentation with Deep Motion Modeling

Current semi-supervised video object segmentation (VOS) methods usually ...
research
09/18/2020

PMVOS: Pixel-Level Matching-Based Video Object Segmentation

Semi-supervised video object segmentation (VOS) aims to segment arbitrar...
research
05/07/2021

Adaptive Focus for Efficient Video Recognition

In this paper, we explore the spatial redundancy in video recognition wi...
research
02/09/2021

SwiftNet: Real-time Video Object Segmentation

In this work we present SwiftNet for real-time semi-supervised video obj...
research
01/30/2020

Fast Video Object Segmentation using the Global Context Module

We developed a real-time, high-quality video object segmentation algorit...
research
03/27/2023

EFEM: Equivariant Neural Field Expectation Maximization for 3D Object Segmentation Without Scene Supervision

We introduce Equivariant Neural Field Expectation Maximization (EFEM), a...
research
11/02/2021

Exploring the Semi-supervised Video Object Segmentation Problem from a Cyclic Perspective

Modern video object segmentation (VOS) algorithms have achieved remarkab...

Please sign up or login with your details

Forgot password? Click here to reset