LFZip: Lossy compression of multivariate floating-point time series data via improved prediction

11/01/2019
by   Shubham Chandak, et al.
0

Time series data compression is emerging as an important problem with the growth in IoT devices and sensors. Due to the presence of noise in these datasets, lossy compression can often provide significant compression gains without impacting the performance of downstream applications. In this work, we propose an error-bounded lossy compressor, LFZip, for multivariate floating-point time series data that provides guaranteed reconstruction up to user-specified maximum absolute error. The compressor is based on the prediction-quantization-entropy coder framework and benefits from improved prediction using linear models and neural networks. We evaluate the compressor on several time series datasets where it outperforms the existing state-of-the-art error-bounded lossy compressors. The code and data are available at https://github.com/shubhamchandak94/LFZip

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/08/2023

Change a Bit to save Bytes: Compression for Floating Point Time-Series Data

The number of IoT devices is expected to continue its dramatic growth in...
research
09/28/2022

Near Lossless Time Series Data Compression Methods using Statistics and Deviation

The last two decades have seen tremendous growth in data collections bec...
research
01/21/2021

Time series compression: a survey

The presence of smart objects is increasingly widespread and their ecosy...
research
06/28/2023

Erasing-based lossless compression method for streaming floating-point time series

There are a prohibitively large number of floating-point time series dat...
research
11/05/2020

Datasets for Benchmarking Floating-Point Compressors

Compression of floating-point data, both lossy and lossless, is a topic ...
research
08/23/2023

Adaptive Encoding Strategies for Erasing-Based Lossless Floating-Point Compression

Lossless floating-point time series compression is crucial for a wide ra...
research
12/02/2022

Ripple: Concept-Based Interpretation for Raw Time Series Models in Education

Time series is the most prevalent form of input data for educational pre...

Please sign up or login with your details

Forgot password? Click here to reset