Seagull: An Infrastructure for Load Prediction and Optimized Resource Allocation

09/27/2020
by   Olga Poppe, et al.
0

Microsoft Azure is dedicated to guarantee high quality of service to its customers, in particular, during periods of high customer activity, while controlling cost. We employ a Data Science (DS) driven solution to predict user load and leverage these predictions to optimize resource allocation. To this end, we built the Seagull infrastructure that processes per-server telemetry, validates the data, trains and deploys ML models. The models are used to predict customer load per server (24h into the future), and optimize service operations. Seagull continually re-evaluates accuracy of predictions, fallback to previously known good models and triggers alerts as appropriate. We deployed this infrastructure in production for PostgreSQL and MySQL servers across all Azure regions, and applied it to the problem of scheduling server backups during low-load time. This minimizes interference with user-induced load and improves customer experience.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/17/2018

The Serverless Scheduling Problem and NOAH

The serverless scheduling problem poses a new challenge to Cloud service...
research
09/27/2019

Improving Resource Allocation in Software-Defined Networks using Clustering

Software-defined networks (SDNs) are a huge evolution in simplifying imp...
research
01/11/2022

Performance Evaluation of Stochastic Bipartite Matching Models

We consider a stochastic bipartite matching model consisting of multi-cl...
research
04/24/2023

Customized Load Profiles Synthesis for Electricity Customers Based on Conditional Diffusion Models

Customers' load profiles are critical resources to support data analytic...
research
01/11/2021

System Design for a Data-driven and Explainable Customer Sentiment Monitor

The most important goal of customer services is to keep the customer sat...
research
08/27/2020

Propensity-to-Pay: Machine Learning for Estimating Prediction Uncertainty

Predicting a customer's propensity-to-pay at an early point in the reven...
research
09/04/2015

Predicting SLA Violations in Real Time using Online Machine Learning

Detecting faults and SLA violations in a timely manner is critical for t...

Please sign up or login with your details

Forgot password? Click here to reset