Holistic Prototype Attention Network for Few-Shot VOS

07/16/2023
by   Yin Tang, et al.
0

Few-shot video object segmentation (FSVOS) aims to segment dynamic objects of unseen classes by resorting to a small set of support images that contain pixel-level object annotations. Existing methods have demonstrated that the domain agent-based attention mechanism is effective in FSVOS by learning the correlation between support images and query frames. However, the agent frame contains redundant pixel information and background noise, resulting in inferior segmentation performance. Moreover, existing methods tend to ignore inter-frame correlations in query videos. To alleviate the above dilemma, we propose a holistic prototype attention network (HPAN) for advancing FSVOS. Specifically, HPAN introduces a prototype graph attention module (PGAM) and a bidirectional prototype attention module (BPAM), transferring informative knowledge from seen to unseen classes. PGAM generates local prototypes from all foreground features and then utilizes their internal correlations to enhance the representation of the holistic prototypes. BPAM exploits the holistic information from support images and video frames by fusing co-attention and self-attention to achieve support-query semantic consistency and inner-frame temporal consistency. Extensive experiments on YouTube-FSVOS have been provided to demonstrate the effectiveness and superiority of our proposed HPAN method.

READ FULL TEXT

page 1

page 3

page 8

page 9

page 10

research
06/20/2022

MSANet: Multi-Similarity and Attention Guidance for Boosting Few-Shot Segmentation

Few-shot segmentation aims to segment unseen-class objects given only a ...
research
09/19/2019

A New Few-shot Segmentation Network Based on Class Representation

This paper studies few-shot segmentation, which is a task of predicting ...
research
10/17/2019

Cross Attention Network for Few-shot Classification

Few-shot classification aims to recognize unlabeled samples from unseen ...
research
08/08/2022

Two-Stream Networks for Object Segmentation in Videos

Existing matching-based approaches perform video object segmentation (VO...
research
06/27/2023

Hierarchical Dense Correlation Distillation for Few-Shot Segmentation-Extended Abstract

Few-shot semantic segmentation (FSS) aims to form class-agnostic models ...
research
03/26/2023

Hierarchical Dense Correlation Distillation for Few-Shot Segmentation

Few-shot semantic segmentation (FSS) aims to form class-agnostic models ...
research
09/21/2023

Efficient Long-Short Temporal Attention Network for Unsupervised Video Object Segmentation

Unsupervised Video Object Segmentation (VOS) aims at identifying the con...

Please sign up or login with your details

Forgot password? Click here to reset