The Case for Learned Index Structures

12/04/2017
by   Tim Kraska, et al.
0

Indexes are models: a B-Tree-Index can be seen as a model to map a key to the position of a record within a sorted array, a Hash-Index as a model to map a key to a position of a record within an unsorted array, and a BitMap-Index as a model to indicate if a data record exists or not. In this exploratory research paper, we start from this premise and posit that all existing index structures can be replaced with other types of models, including deep-learning models, which we term learned indexes. The key idea is that a model can learn the sort order or structure of lookup keys and use this signal to effectively predict the position or existence of records. We theoretically analyze under which conditions learned indexes outperform traditional index structures and describe the main challenges in designing learned index structures. Our initial results show, that by using neural nets we are able to outperform cache-optimized B-Trees by up to 70 several real-world data sets. More importantly though, we believe that the idea of replacing core components of a data management system through learned models has far reaching implications for future systems designs and that this work just provides a glimpse of what might be possible.

READ FULL TEXT

page 11

page 13

page 14

research
11/29/2019

SOSD: A Benchmark for Learned Indexes

A groundswell of recent work has focused on improving data management sy...
research
03/20/2019

Indexes in Microsoft SQL Server

Indexes are the best apposite choice for quickly retrieving the records....
research
11/16/2018

The Potential of Learned Index Structures for Index Compression

Inverted indexes are vital in providing fast key-word-based search. For ...
research
06/23/2020

Benchmarking Learned Indexes

Recent advancements in learned index structures propose replacing existi...
research
08/01/2020

The Price of Tailoring the Index to Your Data: Poisoning Attacks on Learned Index Structures

The concept of learned index structures relies on the idea that the inpu...
research
01/22/2019

Just-in-Time Index Compilation

Creating or modifying a primary index is a time-consuming process, as th...
research
11/28/2017

Julian Ernst Besag, 26 March 1945 -- 6 August 2010, a biographical memoir

Julian Besag's contributions to the discipline of statistics are profoun...

Please sign up or login with your details

Forgot password? Click here to reset