Object-Based Image Coding: A Learning-Driven Revisit

03/18/2020
by   Qi Xia, et al.
0

The Object-Based Image Coding (OBIC) that was extensively studied about two decades ago, promised a vast application perspective for both ultra-low bitrate communication and high-level semantical content understanding, but it had rarely been used due to the inefficient compact representation of object with arbitrary shape. A fundamental issue behind is how to efficiently process the arbitrary-shaped objects at a fine granularity (e.g., feature element or pixel wise). To attack this, we have proposed to apply the element-wise masking and compression by devising an object segmentation network for image layer decomposition, and parallel convolution-based neural image compression networks to process masked foreground objects and background scene separately. All components are optimized in an end-to-end learning framework to intelligently weigh their (e.g., object and background) contributions for visually pleasant reconstruction. We have conducted comprehensive experiments to evaluate the performance on PASCAL VOC dataset at a very low bitrate scenario (e.g., ≲0.1 bits per pixel - bpp) which have demonstrated noticeable subjective quality improvement compared with JPEG2K, HEVC-based BPG and another learned image compression method. All relevant materials are made publicly accessible at https://njuvision.github.io/Neural-Object-Coding/.

READ FULL TEXT

page 2

page 4

page 5

research
07/28/2022

Content-oriented learned image compression

In recent years, with the development of deep neural networks, end-to-en...
research
04/25/2022

High-Efficiency Lossy Image Coding Through Adaptive Neighborhood Information Aggregation

Questing for lossy image coding (LIC) with superior efficiency on both c...
research
07/01/2022

Learning to segment from object sizes

Deep learning has proved particularly useful for semantic segmentation, ...
research
02/10/2022

Dynamic Background Subtraction by Generative Neural Networks

Background subtraction is a significant task in computer vision and an e...
research
01/06/2022

Aerial Scene Parsing: From Tile-level Scene Classification to Pixel-wise Semantic Labeling

Given an aerial image, aerial scene parsing (ASP) targets to interpret t...
research
01/20/2023

Optimized learned entropy coding parameters for practical neural-based image and video compression

Neural-based image and video codecs are significantly more power-efficie...
research
06/20/2017

Clustering-Based Quantisation for PDE-Based Image Compression

Finding optimal data for inpainting is a key problem in the context of p...

Please sign up or login with your details

Forgot password? Click here to reset