Scalable Neural Video Representations with Learnable Positional Features

10/13/2022
by   Subin Kim, et al.
0

Succinct representation of complex signals using coordinate-based neural representations (CNRs) has seen great progress, and several recent efforts focus on extending them for handling videos. Here, the main challenge is how to (a) alleviate a compute-inefficiency in training CNRs to (b) achieve high-quality video encoding while (c) maintaining the parameter-efficiency. To meet all requirements (a), (b), and (c) simultaneously, we propose neural video representations with learnable positional features (NVP), a novel CNR by introducing "learnable positional features" that effectively amortize a video as latent codes. Specifically, we first present a CNR architecture based on designing 2D latent keyframes to learn the common video contents across each spatio-temporal axis, which dramatically improves all of those three requirements. Then, we propose to utilize existing powerful image and video codecs as a compute-/memory-efficient compression procedure of latent codes. We demonstrate the superiority of NVP on the popular UVG benchmark; compared with prior arts, NVP not only trains 2 times faster (less than 5 minutes) but also exceeds their encoding quality as 34.07→34.57 (measured with the PSNR metric), even using >8 times fewer parameters. We also show intriguing properties of NVP, e.g., video inpainting, video frame interpolation, etc.

READ FULL TEXT

page 2

page 7

page 10

page 17

page 18

page 19

page 20

page 21

research
02/28/2022

Learning Cross-Video Neural Representations for High-Quality Frame Interpolation

This paper considers the problem of temporal video interpolation, where ...
research
12/23/2022

FFNeRV: Flow-Guided Frame-Wise Neural Representations for Videos

Neural fields, also known as coordinate-based or implicit neural represe...
research
06/16/2023

HiNeRV: Video Compression with Hierarchical Encoding based Neural Representation

Learning-based video compression is currently one of the most popular re...
research
05/07/2020

Encoding in the Dark Grand Challenge: An Overview

A big part of the video content we consume from video providers consists...
research
04/05/2023

HNeRV: A Hybrid Neural Representation for Videos

Implicit neural representations store videos as neural networks and have...
research
06/24/2023

Real-World Video for Zoom Enhancement based on Spatio-Temporal Coupling

In recent years, single-frame image super-resolution (SR) has become mor...
research
06/10/2019

FASTER Recurrent Networks for Video Classification

Video classification methods often divide the video into short clips, do...

Please sign up or login with your details

Forgot password? Click here to reset