SEACOW: Synopsis Embedded Array Compression using Wavelet Transform

09/16/2021
by   Minsoo Kim, et al.
0

Recently, multidimensional data is produced in various domains; because a large volume of this data is often used in complex analytical tasks, it must be stored compactly and able to respond quickly to queries. Existing compression schemes well reduce the data storage; however, they might increase overall computational costs while performing queries. Effectively querying compressed data requires a compression scheme carefully designed for the tasks. This study presents a novel compression scheme, SEACOW, for storing and querying multidimensional array data. The scheme is based on wavelet transform and utilizes a hierarchical relationship between sub-arrays in the transformed data to compress the array. A result of the compression embeds a synopsis, improving query processing performance while acting as an index. To perform experiments, we implemented an array database, SEACOW storage, and evaluated query processing performance on real data sets. Our experiments show that 1) SEACOW provides a high compression ratio comparable to existing compression schemes and 2) the synopsis improves analytical query processing performance.

READ FULL TEXT
research
04/20/2020

MorphStore: Analytical Query Engine with a Holistic Compression-Enabled Processing Model

In this paper, we present MorphStore, an open-source in-memory columnar ...
research
09/25/2017

Camera-Aware Multi-Resolution Analysis (CAMRA) for Raw Sensor Data Compression

We propose a novel lossless and lossy compression scheme for color filte...
research
02/20/2023

Reducing the memory usage of Lattice-Boltzmann schemes with a DWT-based compression

This paper presents a new solution to address the challenge of increasin...
research
09/18/2021

iWave3D: End-to-end Brain Image Compression with Trainable 3-D Wavelet Transform

With the rapid development of whole brain imaging technology, a large nu...
research
07/04/2017

Ingestion, Indexing and Retrieval of High-Velocity Multidimensional Sensor Data on a Single Node

Multidimensional data are becoming more prevalent, partly due to the ris...
research
01/22/2020

Computing Similarity Queries for Correlated Gaussian Sources

Among many current data processing systems, the objectives are often not...
research
03/18/2023

Lossless Microarray Image Compression by Hardware Array Compactor

Microarray technology is a new and powerful tool for the concurrent moni...

Please sign up or login with your details

Forgot password? Click here to reset