iVPF: Numerical Invertible Volume Preserving Flow for Efficient Lossless Compression

by   Shifeng Zhang, et al.

It is nontrivial to store rapidly growing big data nowadays, which demands high-performance lossless compression techniques. Likelihood-based generative models have witnessed their success on lossless compression, where flow based models are desirable in allowing exact data likelihood optimisation with bijective mappings. However, common continuous flows are in contradiction with the discreteness of coding schemes, which requires either 1) imposing strict constraints on flow models that degrades the performance or 2) coding numerous bijective mapping errors which reduces the efficiency. In this paper, we investigate volume preserving flows for lossless compression and show that a bijective mapping without error is possible. We propose Numerical Invertible Volume Preserving Flow (iVPF) which is derived from the general volume preserving flows. By introducing novel computation algorithms on flow models, an exact bijective mapping is achieved without any numerical error. We also propose a lossless compression algorithm based on iVPF. Experiments on various datasets show that the algorithm based on iVPF achieves state-of-the-art compression ratio over lightweight compression algorithms.



page 1

page 2

page 3

page 4


Integer Discrete Flows and Lossless Compression

Lossless compression methods shorten the expected representation size of...

Compression with Flows via Local Bits-Back Coding

Likelihood-based generative models are the backbones of lossless compres...

iFlow: Numerically Invertible Flows for Efficient Lossless Compression via a Uniform Coder

It was estimated that the world produced 59 ZB (5.9 × 10^13 GB) of data ...

Jacobian Determinant of Normalizing Flows

Normalizing flows learn a diffeomorphic mapping between the target and b...

Improving Variational Auto-Encoders using convex combination linear Inverse Autoregressive Flow

In this paper, we propose a new volume-preserving flow and show that it ...

A Physics-Informed Vector Quantized Autoencoder for Data Compression of Turbulent Flow

Analyzing large-scale data from simulations of turbulent flows is memory...

Self Normalizing Flows

Efficient gradient computation of the Jacobian determinant term is a cor...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.