Learning to Optimize LSM-trees: Towards A Reinforcement Learning based Key-Value Store for Dynamic Workloads

08/14/2023
by   Dingheng Mo, et al.
0

LSM-trees are widely adopted as the storage backend of key-value stores. However, optimizing the system performance under dynamic workloads has not been sufficiently studied or evaluated in previous work. To fill the gap, we present RusKey, a key-value store with the following new features: (1) RusKey is a first attempt to orchestrate LSM-tree structures online to enable robust performance under the context of dynamic workloads; (2) RusKey is the first study to use Reinforcement Learning (RL) to guide LSM-tree transformations; (3) RusKey includes a new LSM-tree design, named FLSM-tree, for an efficient transition between different compaction policies – the bottleneck of dynamic key-value stores. We justify the superiority of the new design with theoretical analysis; (4) RusKey requires no prior workload knowledge for system adjustment, in contrast to state-of-the-art techniques. Experiments show that RusKey exhibits strong performance robustness in diverse workloads, achieving up to 4x better end-to-end performance than the RocksDB system under various settings.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/11/2018

Co-KV: A Collaborative Key-Value Store Using Near-Data Processing to Improve Compaction for the LSM-tree

Log-structured merge tree (LSM-tree) based key-value stores are widely e...
research
04/03/2021

Nova-LSM: A Distributed, Component-based LSM-tree Key-value Store

The cloud infrastructure motivates disaggregation of monolithic data sto...
research
05/02/2023

Updatable Learned Indexes Meet Disk-Resident DBMS – From Evaluations to Design Choices

Although many updatable learned indexes have been proposed in recent yea...
research
09/19/2022

DINOMO: An Elastic, Scalable, High-Performance Key-Value Store for Disaggregated Persistent Memory (Extended Version)

We present Dinomo, a novel key-value store for disaggregated persistent ...
research
04/26/2019

Authenticated Key-Value Stores with Hardware Enclaves

Authenticated data storage on an untrusted platform is an important comp...
research
07/03/2019

An Experimental Evaluation of Large Scale GBDT Systems

Gradient boosting decision tree (GBDT) is a widely-used machine learning...
research
11/17/2017

RLWS: A Reinforcement Learning based GPU Warp Scheduler

The Streaming Multiprocessors (SMs) of a Graphics Processing Unit (GPU) ...

Please sign up or login with your details

Forgot password? Click here to reset