High-Dimensional Bayesian Optimization with Multi-Task Learning for RocksDB

03/30/2021
by   Sami Alabed, et al.
0

RocksDB is a general-purpose embedded key-value store used in multiple different settings. Its versatility comes at the cost of complex tuning configurations. This paper investigates maximizing the throughput of RocksDB IO operations by auto-tuning ten parameters of varying ranges. Off-the-shelf optimizers struggle with high-dimensional problem spaces and require a large number of training samples. We propose two techniques to tackle this problem: multi-task modeling and dimensionality reduction through a manual grouping of parameters. By incorporating adjacent optimization in the model, the model converged faster and found complicated settings that other tuners could not find. This approach had an additional computational complexity overhead, which we mitigated by manually assigning parameters to each sub-goal through our knowledge of RocksDB. The model is then incorporated in a standard Bayesian Optimization loop to find parameters that maximize RocksDB's IO throughput. Our method achieved x1.3 improvement when benchmarked against a simulation of Facebook's social graph traffic, and converged in ten optimization steps compared to other state-of-the-art methods that required fifty steps.

READ FULL TEXT
research
06/22/2020

MUMBO: MUlti-task Max-value Bayesian Optimization

We propose MUMBO, the first high-performing yet computationally efficien...
research
02/27/2019

High-Dimensional Bayesian Optimization with Manifold Gaussian Processes

Bayesian optimization (BO) is a powerful approach for seeking the global...
research
06/04/2018

Efficient and Scalable Batch Bayesian Optimization Using K-Means

We present K-Means Batch Bayesian Optimization (KMBBO), a novel batch sa...
research
12/15/2018

Data-efficient Auto-tuning with Bayesian Optimization: An Industrial Control Study

Bayesian optimization is proposed for automatic learning of optimal cont...
research
08/18/2022

Bayesian Optimization Augmented with Actively Elicited Expert Knowledge

Bayesian optimization (BO) is a well-established method to optimize blac...
research
10/05/2020

Parameter Optimization using high-dimensional Bayesian Optimization

In this thesis, I explore the possibilities of conducting Bayesian optim...
research
02/03/2021

OmiEmbed: reconstruct comprehensive phenotypic information from multi-omics data using multi-task deep learning

High-dimensional omics data contains intrinsic biomedical information th...

Please sign up or login with your details

Forgot password? Click here to reset