Can the Optimizer Cost be Used to Predict Query Execution Times?

05/02/2019
by   Anthony Kleerekoper, et al.
0

Predicting the execution time of queries is an important problem with applications in scheduling, service level agreements and error detection. During query planning, a cost is associated with the chosen execution plan and used to rank competing plans. It would be convenient to use that cost to predict execution time, but it has been claimed in the literature that this is not possible. In this paper, we thoroughly investigate this claim considering both linear and non-linear models. We find that the accuracy using more complex models with only the optimizer cost is comparable to the reported accuracy in the literature. The most accurate method in the literature is nearest-neighbour regression which does not produce a model. The published results used a large feature set to identify nearest neighbours. We show that it is possible to achieve the same level of accuracy using only the cost to identify nearest neighbours. Using a smaller feature set brings the advantages of reduced overhead in terms of both storage space for the training data and the time to produce a prediction.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/17/2017

The Odyssey Approach for Optimizing Federated SPARQL Queries

Answering queries over a federation of SPARQL endpoints requires combini...
research
11/28/2018

Predicting the Computational Cost of Deep Learning Models

Deep learning is rapidly becoming a go-to tool for many artificial intel...
research
09/17/2022

Performance Evaluation of Query Plan Recommendation with Apache Hadoop and Apache Spark

Access plan recommendation is a query optimization approach that execute...
research
03/09/2020

A Note On Operator-Level Query Execution Cost Modeling

External query execution cost modeling using query execution feedback ha...
research
05/07/2020

Bitvector-aware Query Optimization for Decision Support Queries (extended version)

Bitvector filtering is an important query processing technique that can ...
research
07/14/2019

An Approach Based on Bayesian Networks for Query Selectivity Estimation

The efficiency of a query execution plan depends on the accuracy of the ...
research
05/17/2022

Rank-based Heuristics for Optimizing the Execution of Product Data Models

The Product Data Model (PDM) is an example of a data-centric approach to...

Please sign up or login with your details

Forgot password? Click here to reset