Learned Scalable Video Coding For Humans and Machines

07/18/2023
by   Hadi Hadizadeh, et al.
0

Video coding has traditionally been developed to support services such as video streaming, videoconferencing, digital TV, and so on. The main intent was to enable human viewing of the encoded content. However, with the advances in deep neural networks (DNNs), encoded video is increasingly being used for automatic video analytics performed by machines. In applications such as automatic traffic monitoring, analytics such as vehicle detection, tracking and counting, would run continuously, while human viewing could be required occasionally to review potential incidents. To support such applications, a new paradigm for video coding is needed that will facilitate efficient representation and compression of video for both machine and human use in a scalable manner. In this manuscript, we introduce the first end-to-end learnable video codec that supports a machine vision task in its base layer, while its enhancement layer supports input reconstruction for human viewing. The proposed system is constructed based on the concept of conditional coding to achieve better compression gains. Comprehensive experimental evaluations conducted on four standard video datasets demonstrate that our framework outperforms both state-of-the-art learned and conventional video codecs in its base layer, while maintaining comparable performance on the human vision task in its enhancement layer. We will provide the implementation of the proposed system at www.github.com upon completion of the review process.

READ FULL TEXT

page 1

page 4

page 6

page 8

page 10

page 13

research
08/04/2022

Scalable Video Coding for Humans and Machines

Video content is watched not only by humans, but increasingly also by ma...
research
07/05/2023

Base Layer Efficiency in Scalable Human-Machine Coding

A basic premise in scalable human-machine coding is that the base layer ...
research
05/17/2023

VVC+M: Plug and Play Scalable Image Coding for Humans and Machines

Compression for machines is an emerging field, where inputs are encoded ...
research
01/10/2020

Video Coding for Machines: A Paradigm of Collaborative Compression and Intelligent Analytics

Video coding, which targets to compress and reconstruct the whole frame,...
research
03/14/2019

Scalable Facial Image Compression with Deep Feature Reconstruction

In this paper, we propose a scalable image compression scheme, including...
research
08/13/2020

Towards Modality Transferable Visual Information Representation with Optimal Model Compression

Compactly representing the visual signals is of fundamental importance i...
research
04/21/2020

Towards Analysis-friendly Face Representation with Scalable Feature and Texture Compression

It plays a fundamental role to compactly represent the visual informatio...

Please sign up or login with your details

Forgot password? Click here to reset