Magpie: Automatically Tuning Static Parameters for Distributed File Systems using Deep Reinforcement Learning

07/19/2022
by   Houkun Zhu, et al.
0

Distributed file systems are widely used nowadays, yet using their default configurations is often not optimal. At the same time, tuning configuration parameters is typically challenging and time-consuming. It demands expertise and tuning operations can also be expensive. This is especially the case for static parameters, where changes take effect only after a restart of the system or workloads. We propose a novel approach, Magpie, which utilizes deep reinforcement learning to tune static parameters by strategically exploring and exploiting configuration parameter spaces. To boost the tuning of the static parameters, our method employs both server and client metrics of distributed file systems to understand the relationship between static parameters and performance. Our empirical evaluation results show that Magpie can noticeably improve the performance of the distributed file system Lustre, where our approach on average achieves 91.8 configuration after tuning towards single performance indicator optimization, while it reaches 39.7

READ FULL TEXT

page 1

page 2

page 7

page 8

research
02/17/2023

DMSConfig: Automated Configuration Tuning for Distributed IoT Message Systems Using Deep Reinforcement Learning

The Distributed Messaging Systems (DMSs) used in IoT systems require tim...
research
01/16/2023

IOPathTune: Adaptive Online Parameter Tuning for Parallel File System I/O Path

Parallel file systems contain complicated I/O paths from clients to stor...
research
10/10/2017

BestConfig: Tapping the Performance Potential of Systems via Automatic Configuration Tuning

An ever increasing number of configuration parameters are provided to sy...
research
09/14/2018

Auto-tuning Distributed Stream Processing Systems using Reinforcement Learning

Fine tuning distributed systems is considered to be a craftsmanship, rel...
research
01/17/2018

The Case for Automatic Database Administration using Deep Reinforcement Learning

Like any large software system, a full-fledged DBMS offers an overwhelmi...
research
08/31/2018

Autonomous Configuration of Network Parameters in Operating Systems using Evolutionary Algorithms

By default, the Linux network stack is not configured for highspeed larg...
research
07/07/2020

Sapphire: Automatic Configuration Recommendation for Distributed Storage Systems

Modern distributed storage systems come with aplethora of configurable p...

Please sign up or login with your details

Forgot password? Click here to reset