Hierarchical B-frame Video Coding Using Two-Layer CANF without Motion Coding

04/05/2023
by   David Alexandre, et al.
0

Typical video compression systems consist of two main modules: motion coding and residual coding. This general architecture is adopted by classical coding schemes (such as international standards H.265 and H.266) and deep learning-based coding schemes. We propose a novel B-frame coding architecture based on two-layer Conditional Augmented Normalization Flows (CANF). It has the striking feature of not transmitting any motion information. Our proposed idea of video compression without motion coding offers a new direction for learned video coding. Our base layer is a low-resolution image compressor that replaces the full-resolution motion compressor. The low-resolution coded image is merged with the warped high-resolution images to generate a high-quality image as a conditioning signal for the enhancement-layer image coding in full resolution. One advantage of this architecture is significantly reduced computational complexity due to eliminating the motion information compressor. In addition, we adopt a skip-mode coding technique to reduce the transmitted latent samples. The rate-distortion performance of our scheme is slightly lower than that of the state-of-the-art learned B-frame coding scheme, B-CANF, but outperforms other learned B-frame coding schemes. However, compared to B-CANF, our scheme saves 45 for decoding. The code is available at https://nycu-clab.github.io.

READ FULL TEXT
research
12/14/2020

Learned Video Codec with Enriched Reconstruction for CLIC P-frame Coding

This paper proposes a learning-based video codec, specifically used for ...
research
09/13/2020

Improving Deep Video Compression by Resolution-adaptive Flow Coding

In the learning based video compression approaches, it is an essential i...
research
08/30/2023

Neural Video Compression with Temporal Layer-Adaptive Hierarchical B-frame Coding

Neural video compression (NVC) is a rapidly evolving video coding resear...
research
02/13/2023

Dual-layer Image Compression via Adaptive Downsampling and Spatially Varying Upconversion

Ultra high resolution (UHR) images are almost always downsampled to fit ...
research
10/14/2022

On Benefits and Challenges of Conditional Interframe Video Coding in Light of Information Theory

The rise of variational autoencoders for image and video compression has...
research
07/01/2022

Ray-Space Motion Compensation for Lenslet Plenoptic Video Coding

Plenoptic images and videos bearing rich information demand a tremendous...
research
02/27/2021

Transform Network Architectures for Deep Learning based End-to-End Image/Video Coding in Subsampled Color Spaces

Most of the existing deep learning based end-to-end image/video coding (...

Please sign up or login with your details

Forgot password? Click here to reset