Region-of-Interest Based Neural Video Compression

03/03/2022
by   Yura Perugachi-Diaz, et al.
0

Humans do not perceive all parts of a scene with the same resolution, but rather focus on few regions of interest (ROIs). Traditional Object-Based codecs take advantage of this biological intuition, and are capable of non-uniform allocation of bits in favor of salient regions, at the expense of increased distortion the remaining areas: such a strategy allows a boost in perceptual quality under low rate constraints. Recently, several neural codecs have been introduced for video compression, yet they operate uniformly over all spatial locations, lacking the capability of ROI-based processing. In this paper, we introduce two models for ROI-based neural video coding. First, we propose an implicit model that is fed with a binary ROI mask and it is trained by de-emphasizing the distortion of the background. Secondly, we design an explicit latent scaling method, that allows control over the quantization binwidth for different spatial regions of latent variables, conditioned on the ROI mask. By extensive experiments, we show that our methods outperform all our baselines in terms of Rate-Distortion (R-D) performance in the ROI. Moreover, they can generalize to different datasets and to any arbitrary ROI at inference time. Finally, they do not require expensive pixel-level annotations during training, as synthetic ROI masks can be used with little to no degradation in performance. To the best of our knowledge, our proposals are the first solutions that integrate ROI-based capabilities into neural video compression models.

READ FULL TEXT

page 6

page 7

page 9

page 12

page 15

page 16

research
03/16/2023

SigVIC: Spatial Importance Guided Variable-Rate Image Compression

Variable-rate mechanism has improved the flexibility and efficiency of l...
research
09/14/2022

Lossy Image Compression with Conditional Diffusion Models

Denoising diffusion models have recently marked a milestone in high-qual...
research
04/21/2021

Visual Analysis Motivated Rate-Distortion Model for Image Coding

Optimized for pixel fidelity metrics, images compressed by existing imag...
research
02/04/2021

Progressive Neural Image Compression with Nested Quantization and Latent Ordering

We present PLONQ, a progressive neural image compression scheme which pu...
research
03/11/2022

Video Coding for Machines with Feature-Based Rate-Distortion Optimization

Common state-of-the-art video codecs are optimized to deliver a low bitr...
research
03/21/2022

Unified Multivariate Gaussian Mixture for Efficient Neural Image Compression

Modeling latent variables with priors and hyperpriors is an essential pr...
research
05/18/2020

Deep Implicit Volume Compression

We describe a novel approach for compressing truncated signed distance f...

Please sign up or login with your details

Forgot password? Click here to reset