DeepAI AI Chat
Log In Sign Up

Auto-tuning Distributed Stream Processing Systems using Reinforcement Learning

09/14/2018
by   Luis M. Vaquero, et al.
0

Fine tuning distributed systems is considered to be a craftsmanship, relying on intuition and experience. This becomes even more challenging when the systems need to react in near real time, as streaming engines have to do to maintain pre-agreed service quality metrics. In this article, we present an automated approach that builds on a combination of supervised and reinforcement learning methods to recommend the most appropriate lever configurations based on previous load. With this, streaming engines can be automatically tuned without requiring a human to determine the right way and proper time to deploy them. This opens the door to new configurations that are not being applied today since the complexity of managing these systems has surpassed the abilities of human experts. We show how reinforcement learning systems can find substantially better configurations in less time than their human counterparts and adapt to changing workloads.

READ FULL TEXT

page 1

page 2

page 3

page 4

07/19/2022

Magpie: Automatically Tuning Static Parameters for Distributed File Systems using Deep Reinforcement Learning

Distributed file systems are widely used nowadays, yet using their defau...
06/20/2022

Phoebe: QoS-Aware Distributed Stream Processing through Anticipating Dynamic Workloads

Distributed Stream Processing systems have become an essential part of b...
04/05/2021

UDO: Universal Database Optimization using Reinforcement Learning

UDO is a versatile tool for offline tuning of database systems for speci...
04/25/2023

Deep learning based Auto Tuning for Database Management System

The management of database system configurations is a challenging task, ...
04/21/2020

Efficient Adaptation for End-to-End Vision-Based Robotic Manipulation

One of the great promises of robot learning systems is that they will be...
08/16/2020

In-situ Workflow Auto-tuning via Combining Performance Models of Component Applications

In-situ parallel workflows couple multiple component applications, such ...
04/10/2023

Improving ABR Performance for Short Video Streaming Using Multi-Agent Reinforcement Learning with Expert Guidance

In the realm of short video streaming, popular adaptive bitrate (ABR) al...