Two-Stream Networks for Object Segmentation in Videos

08/08/2022
by   Hannan Lu, et al.
0

Existing matching-based approaches perform video object segmentation (VOS) via retrieving support features from a pixel-level memory, while some pixels may suffer from lack of correspondence in the memory (i.e., unseen), which inevitably limits their segmentation performance. In this paper, we present a Two-Stream Network (TSN). Our TSN includes (i) a pixel stream with a conventional pixel-level memory, to segment the seen pixels based on their pixellevel memory retrieval. (ii) an instance stream for the unseen pixels, where a holistic understanding of the instance is obtained with dynamic segmentation heads conditioned on the features of the target instance. (iii) a pixel division module generating a routing map, with which output embeddings of the two streams are fused together. The compact instance stream effectively improves the segmentation accuracy of the unseen pixels, while fusing two streams with the adaptive routing map leads to an overall performance boost. Through extensive experiments, we demonstrate the effectiveness of our proposed TSN, and we also report state-of-the-art performance of 86.1 2018 and 87.5

READ FULL TEXT

page 1

page 7

page 8

research
12/03/2019

Automatic Video Object Segmentation via Motion-Appearance-Stream Fusion and Instance-aware Segmentation

This paper presents a method for automatic video object segmentation bas...
research
07/25/2020

MirrorNet: Bio-Inspired Adversarial Attack for Camouflaged Object Segmentation

Camouflaged objects are generally difficult to be detected in their natu...
research
07/16/2023

Holistic Prototype Attention Network for Few-Shot VOS

Few-shot video object segmentation (FSVOS) aims to segment dynamic objec...
research
09/11/2023

Towards Content-based Pixel Retrieval in Revisited Oxford and Paris

This paper introduces the first two pixel retrieval benchmarks. Pixel re...
research
05/20/2021

A Connected Component Labelling algorithm for multi-pixel per clock cycle video stream

This work describes the hardware implementation of a connected component...
research
03/07/2016

Deep Contrast Learning for Salient Object Detection

Salient object detection has recently witnessed substantial progress due...
research
11/24/2016

Full-Resolution Residual Networks for Semantic Segmentation in Street Scenes

Semantic image segmentation is an essential component of modern autonomo...

Please sign up or login with your details

Forgot password? Click here to reset