Reducing Complexity of HEVC: A Deep Learning Approach

09/19/2017
by   Mai Xu, et al.
0

High Efficiency Video Coding (HEVC) significantly reduces bit-rates over the proceeding H.264 standard but at the expense of extremely high encoding complexity. In HEVC, the quad-tree partition of coding unit (CU) consumes a large proportion of the HEVC encoding complexity, due to the bruteforce search for rate-distortion optimization (RDO). Therefore, this paper proposes a deep learning approach to predict the CU partition for reducing the HEVC complexity at both intraand inter-modes, which is based on convolutional neural network (CNN) and long- and short-term memory (LSTM) network. First, we establish a large-scale database including substantial CU partition data for HEVC intra- and inter-modes. This enables deep learning on the CU partition. Second, we represent the CU partition of an entire coding tree unit (CTU) in the form of a hierarchical CU partition map (HCPM). Then, we propose an early-terminated hierarchical CNN (ETH-CNN) for learning to predict the HCPM. Consequently, the encoding complexity of intra-mode HEVC can be drastically reduced by replacing the brute-force search with ETH-CNN to decide the CU partition. Third, an early-terminated hierarchical LSTM (ETH-LSTM) is proposed to learn the temporal correlation of the CU partition. Then, we combine ETH-LSTM and ETH-CNN to predict the CU partition for reducing the HEVC complexity for intermode. Finally, experimental results show that our approach outperforms other state-of-the-art approaches in reducing the HEVC complexity at both intra- and inter-modes.

READ FULL TEXT

page 1

page 6

page 7

page 13

research
06/23/2020

DeepQTMT: A Deep Learning Approach for Fast QTMT-based CU Partition of Intra-mode VVC

The latest standard Versatile Video Coding (VVC) significantly improves ...
research
09/23/2018

Accelerate CU Partition in HEVC using Large-Scale Convolutional Neural Network

High efficiency video coding (HEVC) suffers high encoding computational ...
research
06/15/2019

Speeding up VP9 Intra Encoder with Hierarchical Deep Learning Based Partition Prediction

In VP9 video codec, the sizes of blocks are decided during encoding by r...
research
01/14/2022

Cross-Block Difference Guided Fast CU Partition for VVC Intra Coding

In this paper, we propose a new fast CU partition algorithm for VVC intr...
research
04/06/2023

Fast QTMT Partition for VVC Intra Coding Using U-Net Framework

Versatile Video Coding (VVC) has significantly increased encoding effici...
research
05/10/2018

Enhancing HEVC Compressed Videos with a Partition-masked Convolutional Neural Network

In this paper, we propose a partition-masked Convolution Neural Network ...
research
11/16/2018

Mode Variational LSTM Robust to Unseen Modes of Variation: Application to Facial Expression Recognition

Spatio-temporal feature encoding is essential for encoding the dynamics ...

Please sign up or login with your details

Forgot password? Click here to reset