End-to-End Rate-Distortion Optimized Learned Hierarchical Bi-Directional Video Compression

12/17/2021
by   M. Akin Yilmaz, et al.
0

Conventional video compression (VC) methods are based on motion compensated transform coding, and the steps of motion estimation, mode and quantization parameter selection, and entropy coding are optimized individually due to the combinatorial nature of the end-to-end optimization problem. Learned VC allows end-to-end rate-distortion (R-D) optimized training of nonlinear transform, motion and entropy model simultaneously. Most works on learned VC consider end-to-end optimization of a sequential video codec based on R-D loss averaged over pairs of successive frames. It is well-known in conventional VC that hierarchical, bi-directional coding outperforms sequential compression because of its ability to use both past and future reference frames. This paper proposes a learned hierarchical bi-directional video codec (LHBDC) that combines the benefits of hierarchical motion-compensated prediction and end-to-end optimization. Experimental results show that we achieve the best R-D results that are reported for learned VC schemes to date in both PSNR and MS-SSIM. Compared to conventional video codecs, the R-D performance of our end-to-end optimized codec outperforms those of both x265 and SVT-HEVC encoders ("veryslow" preset) in PSNR and MS-SSIM as well as HM 16.23 reference software in MS-SSIM. We present ablation studies showing performance gains due to proposed novel tools such as learned masking, flow-field subsampling, and temporal flow vector prediction. The models and instructions to reproduce our results can be found in https://github.com/makinyilmaz/LHBDC/

READ FULL TEXT

page 1

page 6

page 7

page 8

page 10

research
08/11/2020

End-to-End Rate-Distortion Optimization for Bi-Directional Learned Video Compression

Conventional video compression methods employ a linear transform and blo...
research
11/13/2022

Advancing Learned Video Compression with In-loop Frame Prediction

Recent years have witnessed an increasing interest in end-to-end learned...
research
02/09/2022

AIVC: Artificial Intelligence based Video Codec

This paper introduces AIVC, an end-to-end neural video codec. It is base...
research
04/29/2021

ELF-VC: Efficient Learned Flexible-Rate Video Coding

While learned video codecs have demonstrated great promise, they have ye...
research
06/29/2020

OpenDVC: An Open Source Implementation of the DVC Video Compression Method

We introduce an open source Tensorflow implementation of the Deep Video ...
research
12/30/2019

A Hybrid Architecture of Jointly Learning Image Compression and Quality Enhancement with Improved Entropy Minimization

Recently, learned image compression methods have been actively studied. ...
research
07/06/2020

Nonlinear Transform Coding

We review a class of methods that can be collected under the name nonlin...

Please sign up or login with your details

Forgot password? Click here to reset