NoSQL Database Tuning through Machine Learning

12/23/2022
by   Florian Eppinger, et al.
0

NoSQL databases have become an important component of many big data and real-time web applications. Their distributed nature and scalability make them an ideal data storage repository for a variety of use cases. While NoSQL databases are delivered with a default ”off-the-shelf” configuration, they offer configuration settings to adjust a database's behavior and performance to a specific use case and environment. The abundance and oftentimes imperceptible inter-dependencies of configuration settings make it difficult to optimize and performance-tune a NoSQL system. There is no one-size-fits-all configuration and therefore the workload, the physical design, and available resources need to be taken into account when optimizing the configuration of a NoSQL database. This work explores Machine Learning as a means to automatically tune a NoSQL database for optimal performance. Using Random Forest and Gradient Boosting Decision Tree Machine Learning algorithms, multiple Machine Learning models were fitted with a training dataset that incorporates properties of the NoSQL physical configuration (replication and sharding). The best models were then employed as surrogate models to optimize the Database Management System's configuration settings for throughput and latency using a Black-box Optimization algorithm. Using an Apache Cassandra database, multiple experiments were carried out to demonstrate the feasibility of this approach, even across varying physical configurations. The tuned DBMS configurations yielded throughput improvements of up to 4 43 configuration settings.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/07/2020

Sapphire: Automatic Configuration Recommendation for Distributed Storage Systems

Modern distributed storage systems come with aplethora of configurable p...
research
01/21/2019

Predictive Indexing

There has been considerable research on automated index tuning in databa...
research
07/30/2023

IWEK: An Interpretable What-If Estimator for Database Knobs

The knobs of modern database management systems have significant impact ...
research
04/25/2023

Deep learning based Auto Tuning for Database Management System

The management of database system configurations is a challenging task, ...
research
07/11/2017

Hot-Rodding the Browser Engine: Automatic Configuration of JavaScript Compilers

Modern software systems in many application areas offer to the user a mu...
research
06/25/2023

Utilizing deep learning for automated tuning of database management systems

Managing the configurations of a database system poses significant chall...
research
08/31/2018

Autonomous Configuration of Network Parameters in Operating Systems using Evolutionary Algorithms

By default, the Linux network stack is not configured for highspeed larg...

Please sign up or login with your details

Forgot password? Click here to reset