Video Exploration via Video-Specific Autoencoders

03/31/2021
by   Kevin Wang, et al.
8

We present simple video-specific autoencoders that enables human-controllable video exploration. This includes a wide variety of analytic tasks such as (but not limited to) spatial and temporal super-resolution, spatial and temporal editing, object removal, video textures, average video exploration, and correspondence estimation within and across videos. Prior work has independently looked at each of these problems and proposed different formulations. In this work, we observe that a simple autoencoder trained (from scratch) on multiple frames of a specific video enables one to perform a large variety of video processing and editing tasks. Our tasks are enabled by two key observations: (1) latent codes learned by the autoencoder capture spatial and temporal properties of that video and (2) autoencoders can project out-of-sample inputs onto the video-specific manifold. For e.g. (1) interpolating latent codes enables temporal super-resolution and user-controllable video textures; (2) manifold reprojection enables spatial super-resolution, object removal, and denoising without training for any of the tasks. Importantly, a two-dimensional visualization of latent codes via principal component analysis acts as a tool for users to both visualize and intuitively control video edits. Finally, we quantitatively contrast our approach with the prior art and found that without any supervision and task-specific knowledge, our approach can perform comparably to supervised approaches specifically trained for a task.

READ FULL TEXT

page 17

page 18

page 27

page 32

page 33

page 37

page 38

page 39

research
05/11/2022

Spatial-Temporal Space Hand-in-Hand: Spatial-Temporal Video Super-Resolution via Cycle-Projected Mutual Learning

Spatial-Temporal Video Super-Resolution (ST-VSR) aims to generate super-...
research
03/25/2019

Recurrent Back-Projection Network for Video Super-Resolution

We proposed a novel architecture for the problem of video super-resoluti...
research
10/28/2021

MEGAN: Memory Enhanced Graph Attention Network for Space-Time Video Super-Resolution

Space-time video super-resolution (STVSR) aims to construct a high space...
research
08/18/2023

SimDA: Simple Diffusion Adapter for Efficient Video Generation

The recent wave of AI-generated content has witnessed the great developm...
research
07/13/2022

You Only Align Once: Bidirectional Interaction for Spatial-Temporal Video Super-Resolution

Spatial-Temporal Video Super-Resolution (ST-VSR) technology generates hi...
research
07/23/2020

MuCAN: Multi-Correspondence Aggregation Network for Video Super-Resolution

Video super-resolution (VSR) aims to utilize multiple low-resolution fra...

Please sign up or login with your details

Forgot password? Click here to reset