Towards Coding for Human and Machine Vision: A Scalable Image Coding Approach

01/09/2020
by   Yueyu Hu, et al.
98

The past decades have witnessed the rapid development of image and video coding techniques in the era of big data. However, the signal fidelity-driven coding pipeline design limits the capability of the existing image/video coding frameworks to fulfill the needs of both machine and human vision. In this paper, we come up with a novel image coding framework by leveraging both the compressive and the generative models, to support machine vision and human perception tasks jointly. Given an input image, the feature analysis is first applied, and then the generative model is employed to perform image reconstruction with features and additional reference pixels, in which compact edge maps are extracted in this work to connect both kinds of vision in a scalable way. The compact edge map serves as the basic layer for machine vision tasks, and the reference pixels act as a sort of enhanced layer to guarantee signal fidelity for human vision. By introducing advanced generative models, we train a flexible network to reconstruct images from compact feature representations and the reference pixels. Experimental results demonstrate the superiority of our framework in both human visual quality and facial landmark detection, which provide useful evidence on the emerging standardization efforts on MPEG VCM (Video Coding for Machine).

READ FULL TEXT

page 3

page 4

research
01/09/2020

An Emerging Coding Paradigm VCM: A Scalable Coding Approach Beyond Feature and Signal

In this paper, we study a new problem arising from the emerging MPEG sta...
research
02/02/2021

Human-Machine Collaborative Video Coding Through Cuboidal Partitioning

Video coding algorithms encode and decode an entire video frame while fe...
research
10/18/2021

Video Coding for Machine: Compact Visual Representation Compression for Intelligent Collaborative Analytics

Video Coding for Machines (VCM) is committed to bridging to an extent se...
research
06/19/2023

LVVC: A Learned Versatile Video Coding Framework for Efficient Human-Machine Vision

Almost all digital videos are coded into compact representations before ...
research
05/17/2023

VVC+M: Plug and Play Scalable Image Coding for Humans and Machines

Compression for machines is an emerging field, where inputs are encoded ...
research
07/05/2023

Base Layer Efficiency in Scalable Human-Machine Coding

A basic premise in scalable human-machine coding is that the base layer ...
research
11/10/2020

Conceptual Compression via Deep Structure and Texture Synthesis

Existing compression methods typically focus on the removal of signal-le...

Please sign up or login with your details

Forgot password? Click here to reset