Estimating the Resize Parameter in End-to-end Learned Image Compression

04/26/2022
by   Li-Heng Chen, et al.
2

We describe a search-free resizing framework that can further improve the rate-distortion tradeoff of recent learned image compression models. Our approach is simple: compose a pair of differentiable downsampling/upsampling layers that sandwich a neural compression model. To determine resize factors for different inputs, we utilize another neural network jointly trained with the compression model, with the end goal of minimizing the rate-distortion objective. Our results suggest that "compression friendly" downsampled representations can be quickly determined during encoding by using an auxiliary network and differentiable image warping. By conducting extensive experimental tests on existing deep image compression models, we show results that our new resizing parameter estimation framework can provide Bjøntegaard-Delta rate (BD-rate) improvement of about 10 We also carried out a subjective quality study, the results of which show that our new approach yields favorable compressed images. To facilitate reproducible research in this direction, the implementation used in this paper is being made freely available online at: https://github.com/treammm/ResizeCompression.

READ FULL TEXT

page 1

page 3

page 6

page 10

research
05/01/2019

Learned Image Compression with Soft Bit-based Rate-Distortion Optimization

This paper introduces the notion of soft bits to address the rate-distor...
research
05/16/2021

Substitutional Neural Image Compression

We describe Substitutional Neural Image Compression (SNIC), a general ap...
research
09/27/2020

Learning to Improve Image Compression without Changing the Standard Decoder

In recent years we have witnessed an increasing interest in applying Dee...
research
08/21/2021

Variable-Rate Deep Image Compression through Spatially-Adaptive Feature Transform

We propose a versatile deep image compression network based on Spatial F...
research
12/25/2021

Pseudocylindrical Convolutions for Learned Omnidirectional Image Compression

Although equirectangular projection (ERP) is a convenient form to store ...
research
09/13/2023

Differentiable JPEG: The Devil is in the Details

JPEG remains one of the most widespread lossy image coding methods. Howe...
research
03/03/2023

Rotation Invariant Quantization for Model Compression

Post-training Neural Network (NN) model compression is an attractive app...

Please sign up or login with your details

Forgot password? Click here to reset