E-NeRV: Expedite Neural Video Representation with Disentangled Spatial-Temporal Context

07/17/2022
by   Zizhang Li, et al.
0

Recently, the image-wise implicit neural representation of videos, NeRV, has gained popularity for its promising results and swift speed compared to regular pixel-wise implicit representations. However, the redundant parameters within the network structure can cause a large model size when scaling up for desirable performance. The key reason of this phenomenon is the coupled formulation of NeRV, which outputs the spatial and temporal information of video frames directly from the frame index input. In this paper, we propose E-NeRV, which dramatically expedites NeRV by decomposing the image-wise implicit neural representation into separate spatial and temporal context. Under the guidance of this new formulation, our model greatly reduces the redundant model parameters, while retaining the representation ability. We experimentally find that our method can improve the performance to a large extent with fewer parameters, resulting in a more than 8× faster speed on convergence. Code is available at https://github.com/kyleleey/E-NeRV.

READ FULL TEXT

page 19

page 27

research
10/26/2021

NeRV: Neural Representations for Videos

We propose a novel neural representation for videos (NeRV) which encodes...
research
03/20/2023

Polynomial Implicit Neural Representations For Large Diverse Datasets

Implicit neural representations (INR) have gained significant popularity...
research
01/12/2022

Neural Residual Flow Fields for Efficient Video Representations

Implicit neural representation (INR) has emerged as a powerful paradigm ...
research
03/24/2023

Learning Spatial-Temporal Implicit Neural Representations for Event-Guided Video Super-Resolution

Event cameras sense the intensity changes asynchronously and produce eve...
research
04/28/2022

Streaming Multiscale Deep Equilibrium Models

We present StreamDEQ, a method that infers frame-wise representations on...
research
05/30/2020

Complex Sequential Understanding through the Awareness of Spatial and Temporal Concepts

Understanding sequential information is a fundamental task for artificia...
research
07/15/2022

Learning Long-Term Spatial-Temporal Graphs for Active Speaker Detection

Active speaker detection (ASD) in videos with multiple speakers is a cha...

Please sign up or login with your details

Forgot password? Click here to reset