Testing the Robustness of Learned Index Structures

07/23/2022
by   Matthias Bachfischer, et al.
0

While early empirical evidence has supported the case for learned index structures as having favourable average-case performance, little is known about their worst-case performance. By contrast, classical structures are known to achieve optimal worst-case behaviour. This work evaluates the robustness of learned index structures in the presence of adversarial workloads. To simulate adversarial workloads, we carry out a data poisoning attack on linear regression models that manipulates the cumulative distribution function (CDF) on which the learned index model is trained. The attack deteriorates the fit of the underlying ML model by injecting a set of poisoning keys into the training dataset, which leads to an increase in the prediction error of the model and thus deteriorates the overall performance of the learned index structure. We assess the performance of various regression methods and the learned index implementations ALEX and PGM-Index. We show that learned index structures can suffer from a significant performance deterioration of up to 20 on poisoned vs. non-poisoned datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/01/2020

The Price of Tailoring the Index to Your Data: Poisoning Attacks on Learned Index Structures

The concept of learned index structures relies on the idea that the inpu...
research
05/24/2022

NFL: Robust Learned Index via Distribution Transformation

Recent works on learned index open a new direction for the indexing fiel...
research
08/29/2023

SALI: A Scalable Adaptive Learned Index Framework based on Probability Models

The growth in data storage capacity and the increasing demands for high ...
research
01/10/2023

Matching calipers and the precision of index estimation

This paper characterizes the precision of index estimation as it carries...
research
11/29/2019

SOSD: A Benchmark for Learned Indexes

A groundswell of recent work has focused on improving data management sy...
research
05/08/2019

A Scalable Learned Index Scheme in Storage Systems

Index structures are important for efficient data access, which have bee...
research
03/04/2020

Analysis of Indexing Structures for Immutable Data

In emerging applications such as blockchains and collaborative data anal...

Please sign up or login with your details

Forgot password? Click here to reset