Ranked Enumeration of Join Queries with Projections

by   Shaleen Deep, et al.

Join query evaluation with ordering is a fundamental data processing task in relational database management systems. SQL and custom graph query languages such as Cypher offer this functionality by allowing users to specify the order via the ORDER BY clause. In many scenarios, the users also want to see the first k results quickly (expressed by the LIMIT clause), but the value of k is not predetermined as user queries are arriving in an online fashion. Recent work has made considerable progress in identifying optimal algorithms for ranked enumeration of join queries that do not contain any projections. In this paper, we initiate the study of the problem of enumerating results in ranked order for queries with projections. Our main result shows that for any acyclic query, it is possible to obtain a near-linear (in the size of the database) delay algorithm after only a linear time preprocessing step for two important ranking functions: sum and lexicographic ordering. For a practical subset of acyclic queries known as star queries, we show an even stronger result that allows a user to obtain a smooth tradeoff between faster answering time guarantees using more preprocessing time. Our results are also extensible to queries containing cycles and unions. We also perform a comprehensive experimental evaluation to demonstrate that our algorithms, which are simple to implement, improve up to three orders of magnitude in the running time over state-of-the-art algorithms implemented within open-source RDBMS and specialized graph databases.



There are no comments yet.


page 1

page 2

page 3

page 4


Enumeration Algorithms for Conjunctive Queries with Projection

We investigate the enumeration of query results for an important subset ...

Fast Join Project Query Evaluation using Matrix Multiplication

In the last few years, much effort has been devoted to developing join a...

Ranked enumeration of MSO logic on words

In the last years, enumeration algorithms with bounded delay have attrac...

Beyond Equi-joins: Ranking, Enumeration and Factorization

We study full acyclic join queries with general join predicates that inv...

Optimal Join Algorithms Meet Top-k

Top-k queries have been studied intensively in the database community an...

Progressive Evaluation of Queries over Tagged Data

Modern information systems often collect raw data in the form of text, i...

Any-k Algorithms for Enumerating Ranked Answers to Conjunctive Queries

We study ranked enumeration for Conjunctive Queries (CQs) where the answ...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.