Neural Video Compression with Temporal Layer-Adaptive Hierarchical B-frame Coding

08/30/2023
by   Yeongwoong Kim, et al.
0

Neural video compression (NVC) is a rapidly evolving video coding research area, with some models achieving superior coding efficiency compared to the latest video coding standard Versatile Video Coding (VVC). In conventional video coding standards, the hierarchical B-frame coding, which utilizes a bidirectional prediction structure for higher compression, had been well-studied and exploited. In NVC, however, limited research has investigated the hierarchical B scheme. In this paper, we propose an NVC model exploiting hierarchical B-frame coding with temporal layer-adaptive optimization. We first extend an existing unidirectional NVC model to a bidirectional model, which achieves -21.13 this model faces challenges when applied to sequences with complex or large motions, leading to performance degradation. To address this, we introduce temporal layer-adaptive optimization, incorporating methods such as temporal layer-adaptive quality scaling (TAQS) and temporal layer-adaptive latent scaling (TALS). The final model with the proposed methods achieves an impressive BD-rate gain of -39.86 challenges in sequences with large or complex motions with up to -49.13 BD-rate gains than the simple bidirectional extension. This improvement is attributed to the allocation of more bits to lower temporal layers, thereby enhancing overall reconstruction quality with smaller bits. Since our method has little dependency on a specific NVC model architecture, it can serve as a general tool for extending unidirectional NVC models to the ones with hierarchical B-frame coding.

READ FULL TEXT

page 5

page 9

research
04/05/2023

Hierarchical B-frame Video Coding Using Two-Layer CANF without Motion Coding

Typical video compression systems consist of two main modules: motion co...
research
07/09/2023

Predictive Coding For Animation-Based Video Compression

We address the problem of efficiently compressing video for conferencing...
research
12/29/2022

Learned Hierarchical B-frame Coding with Adaptive Feature Modulation for YUV 4:2:0 Content

This paper introduces a learned hierarchical B-frame coding scheme in re...
research
03/08/2023

Scene Matters: Model-based Deep Video Compression

Video compression has always been a popular research area, where many tr...
research
05/13/2022

Swift: Adaptive Video Streaming with Layered Neural Codecs

Layered video coding compresses video segments into layers (additional ...
research
09/11/2023

CANF-VC++: Enhancing Conditional Augmented Normalizing Flows for Video Compression with Advanced Techniques

Video has become the predominant medium for information dissemination, d...
research
05/20/2021

Convolutional Block Design for Learned Fractional Downsampling

The layers of convolutional neural networks (CNNs) can be used to alter ...

Please sign up or login with your details

Forgot password? Click here to reset