Insights from Generative Modeling for Neural Video Compression

07/28/2021
by   Ruihan Yang, et al.
0

While recent machine learning research has revealed connections between deep generative models such as VAEs and rate-distortion losses used in learned compression, most of this work has focused on images. In a similar spirit, we view recently proposed neural video coding algorithms through the lens of deep autoregressive and latent variable modeling. We present recent neural video codecs as instances of a generalized stochastic temporal autoregressive transform, and propose new avenues for further improvements inspired by normalizing flows and structured priors. We propose several architectures that yield state-of-the-art video compression performance on full-resolution video and discuss their tradeoffs and ablations. In particular, we propose (i) improved temporal autoregressive transforms, (ii) improved entropy models with structured and temporal dependencies, and (iii) variable bitrate versions of our algorithms. Since our improvements are compatible with a large class of existing models, we provide further evidence that the generative modeling viewpoint can advance the neural video coding field.

READ FULL TEXT

page 8

page 11

page 14

research
10/19/2020

Hierarchical Autoregressive Modeling for Neural Video Compression

Recent work by Marino et al. (2020) showed improved performance in seque...
research
09/08/2018

Joint Autoregressive and Hierarchical Priors for Learned Image Compression

Recent models for learned image compression are based on autoencoders, l...
research
04/09/2020

Feedback Recurrent Autoencoder for Video Compression

Recent advances in deep generative modeling have enabled efficient model...
research
06/14/2019

MPEG-2 Prediction Residue Analysis

Based on the use of synthetic signals and autoregressive models to chara...
research
08/14/2019

Video Compression With Rate-Distortion Autoencoders

In this paper we present a a deep generative model for lossy video compr...
research
05/21/2019

Compression with Flows via Local Bits-Back Coding

Likelihood-based generative models are the backbones of lossless compres...
research
05/13/2022

Slimmable Video Codec

Neural video compression has emerged as a novel paradigm combining train...

Please sign up or login with your details

Forgot password? Click here to reset