BitE : Accelerating Learned Query Optimization in a Mixed-Workload Environment

06/01/2023
by   Yuri Kim, et al.
0

Although the many efforts to apply deep reinforcement learning to query optimization in recent years, there remains room for improvement as query optimizers are complex entities that require hand-designed tuning of workloads and datasets. Recent research present learned query optimizations results mostly in bulks of single workloads which focus on picking up the unique traits of the specific workload. This proves to be problematic in scenarios where the different characteristics of multiple workloads and datasets are to be mixed and learned together. Henceforth, in this paper, we propose BitE, a novel ensemble learning model using database statistics and metadata to tune a learned query optimizer for enhancing performance. On the way, we introduce multiple revisions to solve several challenges: we extend the search space for the optimal Abstract SQL Plan(represented as a JSON object called ASP) by expanding hintsets, we steer the model away from the default plans that may be biased by configuring the experience with all unique plans of queries, and we deviate from the traditional loss functions and choose an alternative method to cope with underestimation and overestimation of reward. Our model achieves 19.6 existing traditional methods whilst using a comparable level of resources.

READ FULL TEXT

page 9

page 10

research
04/07/2019

Neo: A Learned Query Optimizer

Query optimization is one of the most challenging problems in database s...
research
01/07/2019

Guided Automated Learning for query workload re-Optimization

Query optimization is a hallmark of database systems enabling complex SQ...
research
10/24/2022

Deploying a Steered Query Optimizer in Production at Microsoft

Modern analytical workloads are highly heterogeneous and massively compl...
research
03/27/2023

Learned Query Superoptimization

Traditional query optimizers are designed to be fast and stateless: each...
research
05/26/2021

Database Workload Characterization with Query Plan Encoders

Smart databases are adopting artificial intelligence (AI) technologies t...
research
01/17/2018

Query2Vec: An Evaluation of NLP Techniques for Generalized Workload Analytics

We consider methods for learning vector representations of SQL queries t...
research
01/17/2018

Query2Vec: NLP Meets Databases for Generalized Workload Analytics

We propose methods for learning vector representations of SQL workloads ...

Please sign up or login with your details

Forgot password? Click here to reset