Improved Cardinality Estimation by Learning Queries Containment Rates

08/21/2019
by   Rojeh Hayek, et al.
0

The containment rate of query Q1 in query Q2 over database D is the percentage of Q1's result tuples over D that are also in Q2's result over D. We directly estimate containment rates between pairs of queries over a specific database. For this, we use a specialized deep learning scheme, CRN, which is tailored to representing pairs of SQL queries. Result-cardinality estimation is a core component of query optimization. We describe a novel approach for estimating queries result-cardinalities using estimated containment rates among queries. This containment rate estimation may rely on CRN or embed, unchanged, known cardinality estimation methods. Experimentally, our novel approach for estimating cardinalities, using containment rates between queries, on a challenging real-world database, realizes significant improvements to state of the art cardinality estimation methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/15/2020

NN-based Transformation of Any SQL Cardinality Estimator for Handling DISTINCT, AND, OR and NOT

SQL queries, with the AND, OR, and NOT operators, constitute a broad cla...
research
04/17/2019

Estimating Cardinalities with Deep Sketches

We introduce Deep Sketches, which are compact models of databases that a...
research
09/03/2018

Learned Cardinalities: Estimating Correlated Joins with Deep Learning

We describe a new deep learning approach to cardinality estimation. MSCN...
research
01/29/2018

Estimating the Cardinality of Conjunctive Queries over RDF Data Using Graph Summarisation

Estimating the cardinality (i.e., the number of answers) of conjunctive ...
research
02/04/2020

Using Positional Sequence Patterns to Estimate the Selectivity of SQL LIKE Queries

With the dramatic increase in the amount of the text-based data which co...
research
12/29/2020

BayesCard: A Unified Bayesian Framework for Cardinality Estimation

Cardinality estimation is one of the fundamental problems in database ma...
research
05/19/2020

Machine Learning-based Cardinality Estimation in DBMS on Pre-Aggregated Data

Cardinality estimation is a fundamental task in database query processin...

Please sign up or login with your details

Forgot password? Click here to reset