Facilitating SQL Query Composition and Analysis

02/21/2020
by   Zainab Zolaktaf, et al.
0

Formulating efficient SQL queries requires several cycles of tuning and execution, particularly for inexperienced users. We examine methods that can accelerate and improve this interaction by providing insights about SQL queries prior to execution. We achieve this by predicting properties such as the query answer size, its run-time, and error class. Unlike existing approaches, our approach does not rely on any statistics from the database instance or query execution plans. This is particularly important in settings with limited access to the database instance. Our approach is based on using data-driven machine learning techniques that rely on large query workloads to model SQL queries and their properties. We evaluate the utility of neural network models and traditional machine learning models. We use two real-world query workloads: the Sloan Digital Sky Survey (SDSS) and the SQLShare query workload. Empirical results show that the neural network models are more accurate in predicting the query error class, achieving a higher F-measure on classes with fewer samples as well as performing better on other problems such as run-time and answer size prediction. These results are encouraging and confirm that SQL query workloads and data-driven machine learning methods can be leveraged to facilitate query composition and analysis.

READ FULL TEXT

page 6

page 8

page 21

research
01/17/2018

Query2Vec: An Evaluation of NLP Techniques for Generalized Workload Analytics

We consider methods for learning vector representations of SQL queries t...
research
08/13/2018

DeepBase: Deep Inspection of Neural Networks

Although deep learning models perform remarkably across a range of tasks...
research
01/07/2019

Guided Automated Learning for query workload re-Optimization

Query optimization is a hallmark of database systems enabling complex SQ...
research
04/12/2022

Forecasting SQL Query Cost at Twitter

With the advent of the Big Data era, it is usually computationally expen...
research
08/09/2021

"What makes my queries slow?": Subgroup Discovery for SQL Workload Analysis

Among daily tasks of database administrators (DBAs), the analysis of que...
research
11/30/2021

Maliva: Using Machine Learning to Rewrite Visualization Queries Under Time Constraints

We consider data-visualization systems where a middleware layer translat...
research
04/06/2021

DeepEverest: Accelerating Declarative Top-K Queries for Deep Neural Network Interpretation

We design, implement, and evaluate DeepEverest, a system for the efficie...

Please sign up or login with your details

Forgot password? Click here to reset