Optimal Joins using Compact Data Structures

08/05/2019
by   Gonzalo Navarro, et al.
0

Worst-case optimal join algorithms have gained a lot of attention in the database literature. We now count with several different algorithms that have all been shown to be optimal in the worst case, and many of them have also been implemented and tested in practice. However, the implementation of these algorithms often requires an enhanced indexing structure: to achieve optimality we either need to build completely new indexes or we must populate the database with several different instantiations of common indexes such as B+-trees. Either way, this means spending an extra amount of storage space that may be non-negligible. In this paper we show that optimal algorithms can be obtained directly from a representation that regards the relations as point sets in variable-dimensional grids, without the need of extra storage. Our representation is a compact quadtree for the static indexes and a dynamic quadtree sharing subtrees (which we dub a Qdag) for intermediate results. We develop a compositional algorithm to process full join queries when data is stored in said structures, and then show that the running time of this algorithm is worst-case optimal in data complexity. Remarkably, we can even extend our framework to compute more expressive queries in relational algebra using both unions and a form of limited negation, by introducing a lazy version of Qdags. Once again, we can show that the running time of our algorithms is worst-case optimal.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/29/2019

Worst-Case Optimal Radix Triejoin

Relatively recently, the field of join processing has been swayed by the...
research
07/02/2022

The Programming of Algebra

We present module theory and linear maps as a powerful generalised and c...
research
06/09/2022

Cross-Model Conjunctive Queries over Relation and Tree-structured Data (Extended)

Conjunctive queries are the most basic and central class of database que...
research
01/31/2018

Constant Factor Time Optimal Multi-Robot Routing on High-Dimensional Grids in Mostly Sub-Quadratic Time

Let G = (V, E) be an m_1 ×...× m_k grid. Assuming that each v ∈ V is occ...
research
08/28/2018

A Quantum Interior Point Method for LPs and SDPs

We present a quantum interior point method with worst case running time ...
research
09/26/2019

Box Covers and Domain Orderings for Beyond Worst-Case Join Processing

Recent beyond worst-case optimal join algorithms Minesweeper and its gen...
research
02/11/2018

Distributed Evaluation of Subgraph Queries Using Worstcase Optimal LowMemory Dataflows

We study the problem of finding and monitoring fixed-size subgraphs in a...

Please sign up or login with your details

Forgot password? Click here to reset