Visual Analysis Motivated Rate-Distortion Model for Image Coding

04/21/2021
by   Zhimeng Huang, et al.
0

Optimized for pixel fidelity metrics, images compressed by existing image codec are facing systematic challenges when used for visual analysis tasks, especially under low-bitrate coding. This paper proposes a visual analysis-motivated rate-distortion model for Versatile Video Coding (VVC) intra compression. The proposed model has two major contributions, a novel rate allocation strategy and a new distortion measurement model. We first propose the region of interest for machine (ROIM) to evaluate the degree of importance for each coding tree unit (CTU) in visual analysis. Then, a novel CTU-level bit allocation model is proposed based on ROIM and the local texture characteristics of each CTU. After an in-depth analysis of multiple distortion models, a visual analysis friendly distortion criteria is subsequently proposed by extracting deep feature of each coding unit (CU). To alleviate the problem of lacking spatial context information when calculating the distortion of each CU, we finally propose a multi-scale feature distortion (MSFD) metric using different neighboring pixels by weighting the extracted deep features in each scale. Extensive experimental results show that the proposed scheme could achieve up to 28.17% bitrate saving under the same analysis performance among several typical visual analysis tasks such as image classification, object detection, and semantic segmentation.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

page 6

research
10/16/2019

Reinforced Bit Allocation under Task-Driven Semantic Distortion Metrics

Rapid growing intelligent applications require optimized bit allocation ...
research
11/21/2018

Coding of 3D Videos Based on Visual Discomfort

We propose a rate-distortion optimization method for 3D videos based on ...
research
07/14/2018

Convex Optimization Based Bit Allocation for Light Field Compression under Weighting and Consistency Constraints

Compared with conventional image and video, light field images introduce...
research
02/20/2022

Distortion-Aware Loop Filtering of Intra 360^o Video Coding with Equirectangular Projection

In this paper, we propose a distortion-aware loop filtering model to imp...
research
11/19/2012

Rate-Distortion Analysis of Multiview Coding in a DIBR Framework

Depth image based rendering techniques for multiview applications have b...
research
06/07/2021

Task-driven Semantic Coding via Reinforcement Learning

Task-driven semantic video/image coding has drawn considerable attention...
research
03/03/2022

Region-of-Interest Based Neural Video Compression

Humans do not perceive all parts of a scene with the same resolution, bu...

Please sign up or login with your details

Forgot password? Click here to reset