Temporally stable video segmentation without video annotations

10/17/2021
by   Aharon Azulay, et al.
19

Temporally consistent dense video annotations are scarce and hard to collect. In contrast, image segmentation datasets (and pre-trained models) are ubiquitous, and easier to label for any novel task. In this paper, we introduce a method to adapt still image segmentation models to video in an unsupervised manner, by using an optical flow-based consistency measure. To ensure that the inferred segmented videos appear more stable in practice, we verify that the consistency measure is well correlated with human judgement via a user study. Training a new multi-input multi-output decoder using this measure as a loss, together with a technique for refining current image segmentation datasets and a temporal weighted-guided filter, we observe stability improvements in the generated segmented videos with minimal loss of accuracy.

READ FULL TEXT

page 1

page 2

page 5

page 7

research
04/20/2021

Flow-based Video Segmentation for Human Head and Shoulders

Video segmentation for the human head and shoulders is essential in crea...
research
05/16/2022

Guess What Moves: Unsupervised Video and Image Segmentation by Anticipating Motion

Motion, measured via optical flow, provides a powerful cue to discover a...
research
06/04/2015

A Novel Approach Towards Clustering Based Image Segmentation

In computer vision, image segmentation is always selected as a major res...
research
11/27/2019

LucidDream: Controlled Temporally-Consistent DeepDream on Videos

In this work, we aim to propose a set of techniques to improve the contr...
research
06/08/2016

Point-wise mutual information-based video segmentation with high temporal consistency

In this paper, we tackle the problem of temporally consistent boundary d...
research
04/12/2023

MED-VT: Multiscale Encoder-Decoder Video Transformer with Application to Object Segmentation

Multiscale video transformers have been explored in a wide variety of vi...
research
10/12/2022

LiveSeg: Unsupervised Multimodal Temporal Segmentation of Long Livestream Videos

Livestream videos have become a significant part of online learning, whe...

Please sign up or login with your details

Forgot password? Click here to reset