Deformable Sprites for Unsupervised Video Decomposition

04/14/2022
by   Vickie Ye, et al.
0

We describe a method to extract persistent elements of a dynamic scene from an input video. We represent each scene element as a Deformable Sprite consisting of three components: 1) a 2D texture image for the entire video, 2) per-frame masks for the element, and 3) non-rigid deformations that map the texture image into each video frame. The resulting decomposition allows for applications such as consistent video editing. Deformable Sprites are a type of video auto-encoder model that is optimized on individual videos, and does not require training on a large dataset, nor does it rely on pre-trained models. Moreover, our method does not require object masks or other user input, and discovers moving objects of a wider variety than previous work. We evaluate our approach on standard video datasets and show qualitative results on a diverse array of Internet videos. Code and video results can be found at https://deformable-sprites.github.io

READ FULL TEXT

page 1

page 3

page 6

page 7

page 8

research
09/23/2021

Layered Neural Atlases for Consistent Video Editing

We present a method that decomposes, or "unwraps", an input video into a...
research
03/15/2018

Object Detection in Video with Spatiotemporal Sampling Networks

We propose a Spatiotemporal Sampling Network (STSN) that uses deformable...
research
03/07/2023

MOSO: Decomposing MOtion, Scene and Object for Video Prediction

Motion, scene and object are three primary visual components of a video....
research
05/26/2021

DFPN: Deformable Frame Prediction Network

Learned frame prediction is a current problem of interest in computer vi...
research
04/05/2022

Text2LIVE: Text-Driven Layered Image and Video Editing

We present a method for zero-shot, text-driven appearance manipulation i...
research
03/21/2022

CLIP meets GamePhysics: Towards bug identification in gameplay videos using zero-shot transfer learning

Gameplay videos contain rich information about how players interact with...
research
12/01/2014

Recovering Spatiotemporal Correspondence between Deformable Objects by Exploiting Consistent Foreground Motion in Video

Given unstructured videos of deformable objects, we automatically recove...

Please sign up or login with your details

Forgot password? Click here to reset