NFL: Robust Learned Index via Distribution Transformation

05/24/2022
by   Shangyu Wu, et al.
0

Recent works on learned index open a new direction for the indexing field. The key insight of the learned index is to approximate the mapping between keys and positions with piece-wise linear functions. Such methods require partitioning key space for a better approximation. Although lots of heuristics are proposed to improve the approximation quality, the bottleneck is that the segmentation overheads could hinder the overall performance. This paper tackles the approximation problem by applying a distribution transformation to the keys before constructing the learned index. A two-stage Normalizing-Flow-based Learned index framework (NFL) is proposed, which first transforms the original complex key distribution into a near-uniform distribution, then builds a learned index leveraging the transformed keys. For effective distribution transformation, we propose a Numerical Normalizing Flow (Numerical NF). Based on the characteristics of the transformed keys, we propose a robust After-Flow Learned Index (AFLI). To validate the performance, comprehensive evaluations are conducted on both synthetic and real-world workloads, which shows that the proposed NFL produces the highest throughput and the lowest tail latency compared to the state-of-the-art learned indexes.

READ FULL TEXT

page 9

page 10

page 11

research
07/23/2022

Testing the Robustness of Learned Index Structures

While early empirical evidence has supported the case for learned index ...
research
08/29/2023

SALI: A Scalable Adaptive Learned Index Framework based on Probability Models

The growth in data storage capacity and the increasing demands for high ...
research
09/24/2020

Compressed Key Sort and Fast Index Reconstruction

In this paper we propose an index key compression scheme based on the no...
research
05/21/2019

ALEX: An Updatable Adaptive Learned Index

Recent work on "learned indexes" has revolutionized the way we look at t...
research
10/14/2019

The PGM-index: a multicriteria, compressed and learned approach to data indexing

The recent introduction of learned indexes has shaken the foundations of...
research
03/01/2019

Superseding traditional indexes by orchestrating learning and geometry

We design the first learned index that solves the dictionary problem wit...
research
08/11/2021

Towards Practical Learned Indexing

Latest research proposes to replace existing index structures with learn...

Please sign up or login with your details

Forgot password? Click here to reset