ConvNeXt-ChARM: ConvNeXt-based Transform for Efficient Neural Image Compression

07/12/2023
by   Ahmed Ghorbel, et al.
0

Over the last few years, neural image compression has gained wide attention from research and industry, yielding promising end-to-end deep neural codecs outperforming their conventional counterparts in rate-distortion performance. Despite significant advancement, current methods, including attention-based transform coding, still need to be improved in reducing the coding rate while preserving the reconstruction fidelity, especially in non-homogeneous textured image areas. Those models also require more parameters and a higher decoding time. To tackle the above challenges, we propose ConvNeXt-ChARM, an efficient ConvNeXt-based transform coding framework, paired with a compute-efficient channel-wise auto-regressive prior to capturing both global and local contexts from the hyper and quantized latent representations. The proposed architecture can be optimized end-to-end to fully exploit the context information and extract compact latent representation while reconstructing higher-quality images. Experimental results on four widely-used datasets showed that ConvNeXt-ChARM brings consistent and significant BD-rate (PSNR) reductions estimated on average to 5.24 reference encoder (VTM-18.0) and the state-of-the-art learned image compression method SwinT-ChARM, respectively. Moreover, we provide model scaling studies to verify the computational efficiency of our approach and conduct several objective and subjective analyses to bring to the fore the performance gap between the next generation ConvNet, namely ConvNeXt, and Swin Transformer.

READ FULL TEXT
research
07/05/2023

Joint Hierarchical Priors and Adaptive Spatial Resolution for Efficient Neural Image Compression

Recently, the performance of neural image compression (NIC) has steadily...
research
07/12/2023

AICT: An Adaptive Image Compression Transformer

Motivated by the efficiency investigation of the Tranformer-based transf...
research
03/09/2022

Neural Data-Dependent Transform for Learned Image Compression

Learned image compression has achieved great success due to its excellen...
research
03/04/2021

A Cross Channel Context Model for Latents in Deep Image Compression

This paper presents a cross channel context model for latents in deep im...
research
04/25/2022

High-Efficiency Lossy Image Coding Through Adaptive Neighborhood Information Aggregation

Questing for lossy image coding (LIC) with superior efficiency on both c...
research
01/23/2023

Modality-Agnostic Variational Compression of Implicit Neural Representations

We introduce a modality-agnostic neural data compression algorithm based...
research
12/25/2021

Pseudocylindrical Convolutions for Learned Omnidirectional Image Compression

Although equirectangular projection (ERP) is a convenient form to store ...

Please sign up or login with your details

Forgot password? Click here to reset