Bounding the Last Mile: Efficient Learned String Indexing

11/29/2021
by   Benjamin Spector, et al.
0

We introduce the RadixStringSpline (RSS) learned index structure for efficiently indexing strings. RSS is a tree of radix splines each indexing a fixed number of bytes. RSS approaches or exceeds the performance of traditional string indexes while using 7-70× less memory. RSS achieves this by using the minimal string prefix to sufficiently distinguish the data unlike most learned approaches which index the entire string. Additionally, the bounded-error nature of RSS accelerates the last mile search and also enables a memory-efficient hash-table lookup accelerator. We benchmark RSS on several real-world string datasets against ART and HOT. Our experiments suggest this line of research may be promising for future memory-intensive database applications.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/11/2022

LSI: A Learned Secondary Index Structure

Learned index structures have been shown to achieve favorable lookup per...
research
05/25/2018

Strong link between BWT and XBW via Aho-Corasick automaton and applications to Run-Length Encoding

The boom of genomic sequencing makes compression of set of sequences ine...
research
08/06/2023

Nucleotide String Indexing using Range Matching

The two most common data-structures for genome indexing, FM-indices and ...
research
02/05/2021

A Memory-Efficient FM-Index Constructor for Next-Generation Sequencing Applications on FPGAs

FM-index is an efficient data structure for string search and is widely ...
research
01/04/2021

A Pluggable Learned Index Method via Sampling and Gap Insertion

Database indexes facilitate data retrieval and benefit broad application...
research
03/17/2020

An Efficient Implementation of Manacher's Algorithm

Manacher's algorithm has been shown to be optimal to the longest palindr...
research
04/09/2019

Suffix Trees, DAWGs and CDAWGs for Forward and Backward Tries

The suffix tree, DAWG, and CDAWG are fundamental indexing structures of ...

Please sign up or login with your details

Forgot password? Click here to reset