Worst-Case Optimal Join Algorithms: Techniques, Results, and Open Problems

03/27/2018
by   Hung Q. Ngo, et al.
0

Worst-case optimal join algorithms are the class of join algorithms whose runtime match the worst-case output size of a given join query. While the first provably worse-case optimal join algorithm was discovered relatively recently, the techniques and results surrounding these algorithms grow out of decades of research from a wide range of areas, intimately connecting graph theory, algorithms, information theory, constraint satisfaction, database theory, and geometric inequalities. These ideas are not just paperware: in addition to academic project implementations, two variations of such algorithms are the work-horse join algorithms of commercial database and data analytics engines. This paper aims to be a brief introduction to the design and analysis of worst-case optimal join algorithms. We discuss the key techniques for proving runtime and output size bounds. We particularly focus on the fascinating connection between join algorithms and information theoretic inequalities, and the idea of how one can turn a proof into an algorithm. Finally, we conclude with a representative list of fundamental open problems in this area.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/24/2023

Applications of Information Inequalities to Database Theory Problems

The paper describes several applications of information inequalities to ...
research
12/29/2019

Worst-Case Optimal Radix Triejoin

Relatively recently, the field of join processing has been swayed by the...
research
04/05/2018

Hypertree Decompositions Revisited for PGMs

We revisit the classical problem of exact inference on probabilistic gra...
research
05/31/2013

Theoretical formulation and analysis of the deterministic dendritic cell algorithm

As one of the emerging algorithms in the field of Artificial Immune Syst...
research
06/24/2023

Join Size Bounds using Lp-Norms on Degree Sequences

Estimating the output size of a join query is a fundamental yet longstan...
research
09/26/2019

Box Covers and Domain Orderings for Beyond Worst-Case Join Processing

Recent beyond worst-case optimal join algorithms Minesweeper and its gen...
research
11/10/2017

Size bounds and query plans for relational joins

Relational joins are at the core of relational algebra, which in turn is...

Please sign up or login with your details

Forgot password? Click here to reset