ML-Powered Index Tuning: An Overview of Recent Progress and Open Challenges

08/25/2023
by   Tarique Siddiqui, et al.
0

The scale and complexity of workloads in modern cloud services have brought into sharper focus a critical challenge in automated index tuning – the need to recommend high-quality indexes while maintaining index tuning scalability. This challenge is further compounded by the requirement for automated index implementations to introduce minimal query performance regressions in production deployments, representing a significant barrier to achieving scalability and full automation. This paper directs attention to these challenges within automated index tuning and explores ways in which machine learning (ML) techniques provide new opportunities in their mitigation. In particular, we reflect on recent efforts in developing ML techniques for workload selection, candidate index filtering, speeding up index configuration search, reducing the amount of query optimizer calls, and lowering the chances of performance regressions. We highlight the key takeaways from these efforts and underline the gaps that need to be closed for their effective functioning within the traditional index tuning framework. Additionally, we present a preliminary cross-platform design aimed at democratizing index tuning across multiple SQL-like systems – an imperative in today's continuously expanding data system landscape. We believe our findings will help provide context and impetus to the research and development efforts in automated index tuning.

READ FULL TEXT
research
10/17/2021

A Learning-based Approach Towards Automated Tuning of SSD Configurations

Thanks to the mature manufacturing techniques, solid-state drives (SSDs)...
research
11/16/2020

Towards a General Framework for ML-based Self-tuning Databases

Machine learning (ML) methods have recently emerged as an effective way ...
research
01/21/2019

Predictive Indexing

There has been considerable research on automated index tuning in databa...
research
03/10/2022

LlamaTune: Sample-Efficient DBMS Configuration Tuning

Tuning a database system to achieve optimal performance on a given workl...
research
05/18/2020

A Comparative Exploration of ML Techniques for Tuning Query Degree of Parallelism

There is a large body of recent work applying machine learning (ML) tech...
research
06/23/2020

Tsunami: A Learned Multi-dimensional Index for Correlated Data and Skewed Workloads

Filtering data based on predicates is one of the most fundamental operat...
research
06/01/2020

MLOS: An Infrastructure for Automated Software Performance Engineering

Developing modern systems software is a complex task that combines busin...

Please sign up or login with your details

Forgot password? Click here to reset