Learned Sorted Table Search and Static Indexes in Small Model Space

07/19/2021
by   Domenico Amato, et al.
6

Machine Learning Techniques, properly combined with Data Structures, have resulted in Learned Static Indexes, innovative and powerful tools that speed-up Binary Search, with the use of additional space with respect to the table being searched into. Such space is devoted to the ML model. Although in their infancy, they are methodologically and practically important, due to the pervasiveness of Sorted Table Search procedures. In modern applications, model space is a key factor and, in fact, a major open question concerning this area is to assess to what extent one can enjoy the speed-up of Learned Indexes while using constant or nearly constant space models. We address it here by (a) introducing two new models, i.e., denoted KO-BFS and SY-RMI, respectively; (b) by systematically exploring, for the first time, the time-space trade-offs of a hierarchy of existing models, i.e., the ones in SOSD, together with the new ones. We document a novel and rather complex time-space trade-off picture, which is very informative for users. We experimentally show that the KO-BFS can speed up Interpolation Search and Uniform Binary Search in constant space. For other versions of Binary Search, our second model, together with the bi-criteria PGM index, can achieve a speed-up with a model space of 0.05% more than the one taken by the table, being competitive in terms of time-space trade-off with existing proposals. The SY-RMI and the bi-criteria PGM complement each other quite well across the various levels of the internal memory hierarchy. Finally, our findings are of interest to designers, since they highlight the need for further studies regarding the time-space relation in Learned Indexes.

READ FULL TEXT

page 17

page 18

page 20

page 30

page 31

page 33

page 36

page 39

research
07/20/2020

Learning from Data to Speed-up Sorted Table Search Procedures: Methodology and Practical Guidelines

Sorted Table Search Procedures are the quintessential query-answering to...
research
10/31/2017

Two Error Bounds of Imperfect Binary Search

Suppose we know that an object is in a sorted table and we want to deter...
research
05/15/2018

Time-Space Complexity of Quantum Search Algorithms in Symmetric Cryptanalysis

Performance of cryptanalytic quantum search algorithms is mainly inferre...
research
02/21/2018

Static-Memory-Hard Functions and Nonlinear Space-Time Tradeoffs via Pebbling

Pebble games were originally formulated to study time-space tradeoffs in...
research
04/05/2019

A New Approach to Speed up Combinatorial Search Strategies Using Stack and Hash Table

Owing to the significance of combinatorial search strategies both for ac...

Please sign up or login with your details

Forgot password? Click here to reset