Less than Few: Self-Shot Video Instance Segmentation

04/19/2022
by   Pengwan Yang, et al.
8

The goal of this paper is to bypass the need for labelled examples in few-shot video understanding at run time. While proven effective, in many practical video settings even labelling a few examples appears unrealistic. This is especially true as the level of details in spatio-temporal video understanding and with it, the complexity of annotations continues to increase. Rather than performing few-shot learning with a human oracle to provide a few densely labelled support videos, we propose to automatically learn to find appropriate support videos given a query. We call this self-shot learning and we outline a simple self-supervised learning method to generate an embedding space well-suited for unsupervised retrieval of relevant samples. To showcase this novel setting, we tackle, for the first time, video instance segmentation in a self-shot (and few-shot) setting, where the goal is to segment instances at the pixel-level across the spatial and temporal domains. We provide strong baseline performances that utilize a novel transformer-based model and show that self-shot learning can even surpass few-shot and can be positively combined for further performance gains. Experiments on new benchmarks show that our approach achieves strong performance, is competitive to oracle support in some settings, scales to large unlabelled video collections, and can be combined in a semi-supervised setting.

READ FULL TEXT

page 13

page 22

research
10/23/2021

MaskSplit: Self-supervised Meta-learning for Few-shot Semantic Segmentation

Just like other few-shot learning problems, few-shot segmentation aims t...
research
01/12/2020

Few-shot Action Recognition via Improved Attention with Self-supervision

Most existing few-shot learning methods in computer vision focus on clas...
research
12/19/2019

Learning a Spatio-Temporal Embedding for Video Instance Segmentation

We present a novel embedding approach for video instance segmentation. O...
research
01/26/2020

Weakly Supervised Few-shot Object Segmentation using Co-Attention with Visual and Semantic Inputs

Significant progress has been made recently in developing few-shot objec...
research
06/21/2021

TNT: Text-Conditioned Network with Transductive Inference for Few-Shot Video Classification

Recently, few-shot learning has received increasing interest. Existing e...
research
05/27/2023

Instance-based Max-margin for Practical Few-shot Recognition

In order to mimic the human few-shot learning (FSL) ability better and t...
research
02/01/2018

Annotation-Free and One-Shot Learning for Instance Segmentation of Homogeneous Object Clusters

We propose a novel approach for instance segmen- tation given an image o...

Please sign up or login with your details

Forgot password? Click here to reset