TrickVOS: A Bag of Tricks for Video Object Segmentation

06/27/2023
by   Evangelos Skartados, et al.
0

Space-time memory (STM) network methods have been dominant in semi-supervised video object segmentation (SVOS) due to their remarkable performance. In this work, we identify three key aspects where we can improve such methods; i) supervisory signal, ii) pretraining and iii) spatial awareness. We then propose TrickVOS; a generic, method-agnostic bag of tricks addressing each aspect with i) a structure-aware hybrid loss, ii) a simple decoder pretraining regime and iii) a cheap tracker that imposes spatial constraints in model predictions. Finally, we propose a lightweight network and show that when trained with TrickVOS, it achieves competitive results to state-of-the-art methods on DAVIS and YouTube benchmarks, while being one of the first STM-based SVOS methods that can run in real-time on a mobile device.

READ FULL TEXT

page 2

page 4

research
09/30/2019

Towards Good Practices for Video Object Segmentation

Semi-supervised video object segmentation is an interesting yet challeng...
research
03/14/2023

MobileVOS: Real-Time Video Object Segmentation Contrastive Learning meets Knowledge Distillation

This paper tackles the problem of semi-supervised video object segmentat...
research
09/14/2021

Space Time Recurrent Memory Network

We propose a novel visual memory network architecture for the learning a...
research
06/22/2020

Self-supervised Video Object Segmentation

The objective of this paper is self-supervised representation learning, ...
research
02/10/2020

CRVOS: Clue Refining Network for Video Object Segmentation

The encoder-decoder based methods for semi-supervised video object segme...
research
05/06/2022

Revisiting Pretraining for Semi-Supervised Learning in the Low-Label Regime

Semi-supervised learning (SSL) addresses the lack of labeled data by exp...

Please sign up or login with your details

Forgot password? Click here to reset