PrismDB: Read-aware Log-structured Merge Trees for Heterogeneous Storage

08/05/2020
by   Ashwini Raina, et al.
0

In recent years, emerging hardware storage technologies have focused on divergent goals: better performance or lower cost-per-bit of storage. Correspondingly, data systems that employ these new technologies are optimized either to be fast (but expensive) or cheap (but slow). We take a different approach: by combining multiple tiers of fast and low-cost storage technologies within the same system, we can achieve a Pareto-efficient balance between performance and cost-per-bit. This paper presents the design and implementation of PrismDB, a novel log-structured merge tree based key-value store that exploits a full spectrum of heterogeneous storage technologies (from 3D XPoint to QLC NAND). We introduce the notion of "read-awareness" to log-structured merge trees, which allows hot objects to be pinned to faster storage, achieving better tiering and hot-cold separation of objects. Compared to the standard use of RocksDB on flash in datacenters today, PrismDB's average throughput on heterogeneous storage is 2.3× faster and its tail latency is more than an order of magnitude better, using hardware than is half the cost.

READ FULL TEXT
research
05/24/2022

Efficient LSM-Tree Key-Value Data Management on Hybrid SSD/HDD Zoned Storage

Zoned storage devices, such as zoned namespace (ZNS) solid-state drives ...
research
05/28/2020

Learning How To Learn Within An LSM-based Key-Value Store

We introduce BOURBON, a log-structured merge (LSM) tree that utilizes ma...
research
05/08/2023

Autumn: A Scalable Read Optimized LSM-tree based Key-Value Stores with Fast Point and Range Read Speed

The Log Structured Merge Trees (LSM-tree) based key-value stores are wid...
research
06/08/2020

Lethe: A Tunable Delete-Aware LSM Engine (Updated Version)

Data-intensive applications fueled the evolution of log structured merge...
research
09/10/2018

The Skiplist-Based LSM Tree

Log-Structured Merge (LSM) Trees provide a tiered data storage and retri...
research
10/17/2019

An LSM-based Tuple Compaction Framework for Apache AsterixDB

Document database systems store self-describing records, such as JSON, "...
research
12/21/2021

Maxwell: a hardware and software highly integrated compute-storage system

The compute-storage framework is responsible for data storage and proces...

Please sign up or login with your details

Forgot password? Click here to reset