An Adaptive Column Compression Family for Self-Driving Databases

09/06/2022
by   Marcell Fehér, et al.
0

Modern in-memory databases are typically used for high-performance workloads, therefore they have to be optimized for small memory footprint and high query speed at the same time. Data compression has the potential to reduce memory requirements but often reduces query speed too. In this paper we propose a novel, adaptive compressor that offers a new trade-off point of these dimensions, achieving better compression than LZ4 while reaching query speeds close to the fastest existing segment encoders. We evaluate our compressor both with synthetic data in isolation and on the TPC-H and Join Order Benchmarks, integrated into a modern relational column store, Hyrise.

READ FULL TEXT
research
04/20/2020

MorphStore: Analytical Query Engine with a Holistic Compression-Enabled Processing Model

In this paper, we present MorphStore, an open-source in-memory columnar ...
research
05/18/2021

LEA: A Learned Encoding Advisor for Column Stores

Data warehouses organize data in a columnar format to enable faster scan...
research
09/18/2018

HDTCat: let's make HDT scale

HDT (Header, Dictionary, Triples) is a serialization for RDF. HDT has be...
research
05/19/2021

Revisiting Data Compression in Column-Stores

Data compression is widely used in contemporary column-oriented DBMSes t...
research
04/27/2019

A computational model for analytic column stores

This work presents an abstract model for the computations performed by a...
research
03/05/2020

Order-Preserving Key Compression for In-Memory Search Trees

We present the High-speed Order-Preserving Encoder (HOPE) for in-memory ...
research
04/17/2023

Hybrid Materialization in a Disk-Based Column-Store

In column-oriented query processing, a materialization strategy determin...

Please sign up or login with your details

Forgot password? Click here to reset