HiNeRV: Video Compression with Hierarchical Encoding based Neural Representation

06/16/2023
by   Ho Man Kwan, et al.
0

Learning-based video compression is currently one of the most popular research topics, offering the potential to compete with conventional standard video codecs. In this context, Implicit Neural Representations (INRs) have previously been used to represent and compress image and video content, demonstrating relatively high decoding speed compared to other methods. However, existing INR-based methods have failed to deliver rate quality performance comparable with the state of the art in video compression. This is mainly due to the simplicity of the employed network architectures, which limit their representation capability. In this paper, we propose HiNeRV, an INR that combines bilinear interpolation with novel hierarchical positional encoding. This structure employs depth-wise convolutional and MLP layers to build a deep and wide network architecture with much higher capacity. We further build a video codec based on HiNeRV and a refined pipeline for training, pruning and quantization that can better preserve HiNeRV's performance during lossy model compression. The proposed method has been evaluated on both UVG and MCL-JCV datasets for video compression, demonstrating significant improvement over all existing INRs baselines and competitive performance when compared to learning-based codecs (72.3 DCVC on the UVG dataset, measured in PSNR).

READ FULL TEXT
research
12/23/2022

FFNeRV: Flow-Guided Frame-Wise Neural Representations for Videos

Neural fields, also known as coordinate-based or implicit neural represe...
research
04/05/2023

HNeRV: A Hybrid Neural Representation for Videos

Implicit neural representations store videos as neural networks and have...
research
12/02/2021

Neural Weight Step Video Compression

A variety of compression methods based on encoding images as weights of ...
research
12/12/2017

Learning Compressible 360° Video Isomers

Standard video encoders developed for conventional narrow field-of-view ...
research
10/13/2022

Scalable Neural Video Representations with Learnable Positional Features

Succinct representation of complex signals using coordinate-based neural...
research
02/14/2022

MuZero with Self-competition for Rate Control in VP9 Video Compression

Video streaming usage has seen a significant rise as entertainment, educ...
research
05/11/2023

A Deep Learning-based Compression and Classification Technique for Whole Slide Histopathology Images

This paper presents an autoencoder-based neural network architecture to ...

Please sign up or login with your details

Forgot password? Click here to reset