Enabling Efficient Updates in KV Storage via Hashing: Design and Performance Evaluation

11/25/2018
by   Yongkun Li, et al.
0

Persistent key-value (KV) stores mostly build on the Log-Structured Merge (LSM) tree for high write performance, yet the LSM-tree suffers from the inherently high I/O amplification. KV separation mitigates I/O amplification by storing only keys in the LSM-tree and values in separate storage. However, the current KV separation design remains inefficient under update-intensive workloads due to its high garbage collection (GC) overhead in value storage. We propose HashKV, which aims for high update performance atop KV separation under update-intensive workloads. HashKV uses hash-based data grouping, which deterministically maps values to storage space so as to make both updates and GC efficient. We further relax the restriction of such deterministic mappings via simple but useful design extensions. We extensively evaluate various design aspects of HashKV. We show that HashKV achieves 4.6x update throughput and 53.4 addition, we demonstrate that we can integrate the design of HashKV with state-of-the-art KV stores and improve their respective performance.

READ FULL TEXT

page 20

page 21

research
05/28/2020

CedrusDB: Persistent Key-Value Store with Memory-Mapped Lazy-Trie

As RAM is becoming cheaper and growing abundant, it is time to revisit t...
research
07/11/2018

Co-KV: A Collaborative Key-Value Store Using Near-Data Processing to Improve Compaction for the LSM-tree

Log-structured merge tree (LSM-tree) based key-value stores are widely e...
research
05/01/2023

An Update-intensive LSM-based R-tree Index

Many applications require update-intensive workloads on spatial objects,...
research
06/07/2021

Balancing Garbage Collection vs I/O Amplification using hybrid Key-Value Placement in LSM-based Key-Value Stores

Key-value (KV) separation is a technique that introduces randomness in t...
research
07/14/2021

Consistent RDMA-Friendly Hashing on Remote Persistent Memory

Coalescing RDMA and Persistent Memory (PM) delivers high end-to-end perf...
research
04/26/2019

Authenticated Key-Value Stores with Hardware Enclaves

Authenticated data storage on an untrusted platform is an important comp...
research
12/31/2021

Elimination (a,b)-trees with fast, durable updates

Many concurrent dictionary implementations are designed and optimized fo...

Please sign up or login with your details

Forgot password? Click here to reset