LVVC: A Learned Versatile Video Coding Framework for Efficient Human-Machine Vision

06/19/2023
by   Xihua Sheng, et al.
0

Almost all digital videos are coded into compact representations before being transmitted. Such compact representations need to be decoded back to pixels before being displayed to human and - as usual - before being processed/analyzed by machine vision algorithms. For machine vision, it is more efficient at least conceptually, to process/analyze the coded representations directly without decoding them into pixels. Motivated by this concept, we propose a learned versatile video coding (LVVC) framework, which targets on learning compact representations to support both decoding and direct processing/analysis, thereby being versatile for both human and machine vision. Our LVVC framework has a feature-based compression loop, where one frame is encoded (resp. decoded) to intermediate features, and the intermediate features are referenced for encoding (resp. decoding) the following frames. Our proposed feature-based compression loop has two key technologies, one is feature-based temporal context mining, and the other is cross-domain motion encoder/decoder. With the LVVC framework, the intermediate features may be used to reconstruct videos, or be fed into different task networks. The LVVC framework is implemented and evaluated with video reconstruction, video processing, and video analysis tasks on the well-established benchmark datasets. The evaluation results demonstrate the compression efficiency of the proposed LVVC framework.

READ FULL TEXT

page 3

page 4

page 5

page 6

page 7

page 11

page 12

research
01/09/2020

An Emerging Coding Paradigm VCM: A Scalable Coding Approach Beyond Feature and Signal

In this paper, we study a new problem arising from the emerging MPEG sta...
research
01/10/2020

Video Coding for Machines: A Paradigm of Collaborative Compression and Intelligent Analytics

Video coding, which targets to compress and reconstruct the whole frame,...
research
01/09/2020

Towards Coding for Human and Machine Vision: A Scalable Image Coding Approach

The past decades have witnessed the rapid development of image and video...
research
05/21/2020

Complexity Analysis Of Next-Generation VVC Encoding and Decoding

While the next generation video compression standard, Versatile Video Co...
research
09/10/2020

Key-Point Sequence Lossless Compression for Intelligent Video Analysis

Feature coding has been recently considered to facilitate intelligent vi...
research
04/29/2021

Automatic Generation of H.264 Parameter Sets to Recover Video File Fragments

We address the problem of decoding video file fragments when the necessa...
research
11/27/2021

Temporal Context Mining for Learned Video Compression

We address end-to-end learned video compression with a special focus on ...

Please sign up or login with your details

Forgot password? Click here to reset