Multiscale Memory Comparator Transformer for Few-Shot Video Segmentation

07/15/2023
by   Mennatullah Siam, et al.
0

Few-shot video segmentation is the task of delineating a specific novel class in a query video using few labelled support images. Typical approaches compare support and query features while limiting comparisons to a single feature layer and thereby ignore potentially valuable information. We present a meta-learned Multiscale Memory Comparator (MMC) for few-shot video segmentation that combines information across scales within a transformer decoder. Typical multiscale transformer decoders for segmentation tasks learn a compressed representation, their queries, through information exchange across scales. Unlike previous work, we instead preserve the detailed feature maps during across scale information exchange via a multiscale memory transformer decoding to reduce confusion between the background and novel class. Integral to the approach, we investigate multiple forms of information exchange across scales in different tasks and provide insights with empirical evidence on which to use in each task. The overall comparisons among query and support features benefit from both rich semantics and precise localization. We demonstrate our approach primarily on few-shot video object segmentation and an adapted version on the fully supervised counterpart. In all cases, our approach outperforms the baseline and yields state-of-the-art performance. Our code is publicly available at https://github.com/MSiam/MMC-MultiscaleMemory.

READ FULL TEXT

page 1

page 3

page 8

page 15

page 16

research
04/12/2023

MED-VT: Multiscale Encoder-Decoder Video Transformer with Application to Object Segmentation

Multiscale video transformers have been explored in a wide variety of vi...
research
06/01/2021

Prior-Enhanced Few-Shot Segmentation with Meta-Prototypes

Few-shot segmentation (FSS) performance has been extensively promoted by...
research
03/18/2022

Local-Global Context Aware Transformer for Language-Guided Video Segmentation

We explore the task of language-guided video segmentation (LVS). Previou...
research
07/17/2023

Hierarchical Spatiotemporal Transformers for Video Object Segmentation

This paper presents a novel framework called HST for semi-supervised vid...
research
10/13/2022

Feature-Proxy Transformer for Few-Shot Segmentation

Few-shot segmentation (FSS) aims at performing semantic segmentation on ...
research
09/28/2019

Feature Weighting and Boosting for Few-Shot Segmentation

This paper is about few-shot segmentation of foreground objects in image...
research
12/09/2022

MSI: Maximize Support-Set Information for Few-Shot Segmentation

FSS(Few-shot segmentation) aims to segment a target class with a small n...

Please sign up or login with your details

Forgot password? Click here to reset