Learning Multi-dimensional Indexes

12/03/2019
by   Vikram Nathan, et al.
0

Scanning and filtering over multi-dimensional tables are key operations in modern analytical database engines. To optimize the performance of these operations, databases often create clustered indexes over a single dimension or multi-dimensional indexes such as R-trees, or use complex sort orders (e.g., Z-ordering). However, these schemes are often hard to tune and their performance is inconsistent across different datasets and queries. In this paper, we introduce Flood, a multi-dimensional in-memory index that automatically adapts itself to a particular dataset and workload by jointly optimizing the index structure and data storage. Flood achieves up to three orders of magnitude faster performance for range scans with predicates than state-of-the-art multi-dimensional indexes or sort orders on real-world datasets and workloads. Our work serves as a building block towards an end-to-end learned database system.

READ FULL TEXT

page 10

page 11

research
06/23/2020

Tsunami: A Learned Multi-dimensional Index for Correlated Data and Skewed Workloads

Filtering data based on predicates is one of the most fundamental operat...
research
02/12/2021

Spatial Interpolation-based Learned Index for Range and kNN Queries

A corpus of recent work has revealed that the learned index can improve ...
research
08/24/2020

The Case for Learned Spatial Indexes

Spatial data is ubiquitous. Massive amounts of data are generated every ...
research
03/16/2018

Distributed Caching for Complex Querying of Raw Arrays

As applications continue to generate multi-dimensional data at exponenti...
research
02/05/2018

Mitigating Spreadsheet Risk in Complex Multi-Dimensional Models in Excel

Microsoft Excel is the most ubiquitous analytical tool ever built. Compa...
research
05/02/2019

SUMMARIZED: Efficient Framework for Analyzing Multidimensional Process Traces under Edit-distance Constraint

Domains such as scientific workflows and business processes exhibit data...
research
08/07/2022

Automatically Finding Optimal Index Structure

Existing learned indexes (e.g., RMI, ALEX, PGM) optimize the internal re...

Please sign up or login with your details

Forgot password? Click here to reset