Updatable Learned Indexes Meet Disk-Resident DBMS – From Evaluations to Design Choices

05/02/2023
by   Hai Lan, et al.
0

Although many updatable learned indexes have been proposed in recent years, whether they can outperform traditional approaches on disk remains unknown. In this study, we revisit and implement four state-of-the-art updatable learned indexes on disk, and compare them against the B+-tree under a wide range of settings. Through our evaluation, we make some key observations: 1) Overall, the B+-tree performs well across a range of workload types and datasets. 2) A learned index could outperform B+-tree or other learned indexes on disk for a specific workload. For example, PGM achieves the best performance in write-only workloads while LIPP significantly outperforms others in lookup-only workloads. We further conduct a detailed performance analysis to reveal the strengths and weaknesses of these learned indexes on disk. Moreover, we summarize the observed common shortcomings in five categories and propose four design principles to guide future design of on-disk, updatable learned indexes: (1) reducing the index's tree height, (2) better data structures to lower operation overheads, (3) improving the efficiency of scan operations, and (4) more efficient storage layout.

READ FULL TEXT

page 10

page 12

page 15

research
06/05/2023

A Simple Yet High-Performing On-disk Learned Index: Can We Have Our Cake and Eat it Too?

While in-memory learned indexes have shown promising performance as comp...
research
04/12/2021

Updatable Learned Index with Precise Positions

Index plays an essential role in modern database engines to accelerate t...
research
07/06/2022

Are Updatable Learned Indexes Ready?

Recently, numerous promising results have shown that updatable learned i...
research
08/14/2023

Learning to Optimize LSM-trees: Towards A Reinforcement Learning based Key-Value Store for Dynamic Workloads

LSM-trees are widely adopted as the storage backend of key-value stores....
research
09/17/2021

Micro-architectural Analysis of a Learned Index

Since the publication of The Case for Learned Index Structures in 2018, ...
research
05/21/2019

ALEX: An Updatable Adaptive Learned Index

Recent work on "learned indexes" has revolutionized the way we look at t...
research
07/03/2019

An Experimental Evaluation of Large Scale GBDT Systems

Gradient boosting decision tree (GBDT) is a widely-used machine learning...

Please sign up or login with your details

Forgot password? Click here to reset