Exploring Long Short Range Temporal Information for Learned Video Compression

08/07/2022
by   Huairui Wang, et al.
0

Learned video compression methods have gained a variety of interest in the video coding community since they have matched or even exceeded the rate-distortion (RD) performance of traditional video codecs. However, many current learning-based methods are dedicated to utilizing short-range temporal information, thus limiting their performance. In this paper, we focus on exploiting the unique characteristics of video content and further exploring temporal information to enhance compression performance. Specifically, for long-range temporal information exploitation, we propose temporal prior that can update continuously within the group of pictures (GOP) during inference. In that case temporal prior contains valuable temporal information of all decoded images within the current GOP. As for short-range temporal information, we propose a progressive guided motion compensation to achieve robust and effective compensation. In detail, we design a hierarchical structure to achieve multi-scale compensation. More importantly, we use optical flow guidance to generate pixel offsets between feature maps at each scale, and the compensation results at each scale will be used to guide the following scale's compensation. Sufficient experimental results demonstrate that our method can obtain better RD performance than state-of-the-art video compression approaches. The code is publicly available on: https://github.com/Huairui/LSTVC.

READ FULL TEXT
research
07/21/2022

Mining Relations among Cross-Frame Affinities for Video Semantic Segmentation

The essence of video semantic segmentation (VSS) is how to leverage temp...
research
02/26/2023

Continuous Space-Time Video Super-Resolution Utilizing Long-Range Temporal Information

In this paper, we consider the task of space-time video super-resolution...
research
07/11/2022

Learned Video Compression via Heterogeneous Deformable Compensation Network

Learned video compression has recently emerged as an essential research ...
research
02/28/2023

Neural Video Compression with Diverse Contexts

For any video codecs, the coding efficiency highly relies on whether the...
research
03/17/2023

Video Dehazing via a Multi-Range Temporal Alignment Network with Physical Prior

Video dehazing aims to recover haze-free frames with high visibility and...
research
01/17/2020

Temporal Interlacing Network

For a long time, the vision community tries to learn the spatio-temporal...
research
03/21/2021

PGT: A Progressive Method for Training Models on Long Videos

Convolutional video models have an order of magnitude larger computation...

Please sign up or login with your details

Forgot password? Click here to reset