Neural Distributed Image Compression with Cross-Attention Feature Alignment

07/18/2022
by   Nitish Mital, et al.
19

We propose a novel deep neural network (DNN) architecture for compressing an image when a correlated image is available as side information only at the decoder side, a special case of the well-known and heavily studied distributed source coding (DSC) problem. In particular, we consider a pair of stereo images, which have overlapping fields of view, captured by a synchronized and calibrated pair of cameras; and therefore, are highly correlated. We assume that one image of the pair is to be compressed and transmitted, while the other image is available only at the decoder. In the proposed architecture, the encoder maps the input image to a latent space using a DNN, quantizes the latent representation, and compresses it losslessly using entropy coding. The proposed decoder extracts useful information common between the images solely from the available side information, as well as a latent representation of the side information. Then, the latent representations of the two images, one received from the encoder, the other extracted locally, along with the locally generated common information, are fed to the respective decoders of the two images. We employ a cross-attention module (CAM) to align the feature maps obtained in the intermediate layers of the respective decoders of the two images, thus allowing better utilization of the side information. We train and demonstrate the effectiveness of the proposed algorithm on various realistic setups, such as KITTI and Cityscape datasets of stereo image pairs. Our results show that the proposed architecture is capable of exploiting the decoder-only side information in a more efficient manner as it outperforms previous works. We also show that the proposed method is able to provide significant gains even in the case of uncalibrated and unsynchronized camera array use cases.

READ FULL TEXT

page 7

page 8

page 9

page 10

page 11

page 14

page 15

page 16

research
06/22/2021

Deep Stereo Image Compression with Decoder Side Information using Wyner Common Information

We present a novel deep neural network (DNN) architecture for compressin...
research
01/25/2022

Distributed Image Transmission using Deep Joint Source-Channel Coding

We study the problem of deep joint source-channel coding (D-JSCC) for co...
research
07/18/2023

ECSIC: Epipolar Cross Attention for Stereo Image Compression

In this paper, we present ECSIC, a novel learned method for stereo image...
research
09/20/2023

Neural Image Compression Using Masked Sparse Visual Representation

We study neural image compression based on the Sparse Visual Representat...
research
11/18/2020

Convolutional Autoencoder for Blind Hyperspectral Image Unmixing

In the remote sensing context spectral unmixing is a technique to decomp...
research
08/09/2019

DSIC: Deep Stereo Image Compression

In this paper we tackle the problem of stereo image compression, and lev...
research
05/07/2023

Learned Wyner-Ziv Compressors Recover Binning

We consider lossy compression of an information source when the decoder ...

Please sign up or login with your details

Forgot password? Click here to reset