PRESTO: Probabilistic Cardinality Estimation for RDF Queries Based on Subgraph Overlapping

01/19/2018
by   Xin Wang, et al.
0

In query optimisation accurate cardinality estimation is essential for finding optimal query plans. It is especially challenging for RDF due to the lack of explicit schema and the excessive occurrence of joins in RDF queries. Existing approaches typically collect statistics based on the counts of triples and estimate the cardinality of a query as the product of its join components, where errors can accumulate even when the estimation of each component is accurate. As opposed to existing methods, we propose PRESTO, a cardinality estimation method that is based on the counts of subgraphs instead of triples and uses a probabilistic method to estimate cardinalities of RDF queries as a whole. PRESTO avoids some major issues of existing approaches and is able to accurately estimate arbitrary queries under a bound memory constraint. We evaluate PRESTO with YAGO and show that PRESTO is more accurate for both simple and complex queries.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/21/2021

LMKG: Learned Models for Cardinality Estimation in Knowledge Graphs

Accurate cardinality estimates are a key ingredient to achieve optimal q...
research
08/11/2021

A General Cardinality Estimation Framework for Subgraph Matching in Property Graphs

Many techniques have been developed for the cardinality estimation probl...
research
06/21/2019

Learning to Sample: Counting with Complex Queries

In this paper we present a suite of methods to efficiently estimate coun...
research
03/31/2023

Scardina: Scalable Join Cardinality Estimation by Multiple Density Estimators

In recent years, machine learning-based cardinality estimation methods a...
research
06/01/2023

Finding Performance Issues in Database Engines via Cardinality Estimation Testing

Database Management Systems (DBMSs) process a given query by creating an...
research
01/11/2022

Degree Sequence Bound For Join Cardinality Estimation

Recent work has demonstrated the catastrophic effects of poor cardinalit...
research
02/02/2021

Interactive Query Formulation using Point to Point Queries

Effective information disclosure in the context of databases with a larg...

Please sign up or login with your details

Forgot password? Click here to reset