RobustScaler: QoS-Aware Autoscaling for Complex Workloads

04/14/2022
by   Huajie Qian, et al.
0

Autoscaling is a critical component for efficient resource utilization with satisfactory quality of service (QoS) in cloud computing. This paper investigates proactive autoscaling for widely-used scaling-per-query applications where scaling is required for each query, such as container registry and function-as-a-service (FaaS). In these scenarios, the workload often exhibits high uncertainty with complex temporal patterns like periodicity, noises and outliers. Conservative strategies that scale out unnecessarily many instances lead to high resource costs whereas aggressive strategies may result in poor QoS. We present RobustScaler to achieve superior trade-off between cost and QoS. Specifically, we design a novel autoscaling framework based on non-homogeneous Poisson processes (NHPP) modeling and stochastically constrained optimization. Furthermore, we develop a specialized alternating direction method of multipliers (ADMM) to efficiently train the NHPP model, and rigorously prove the QoS guarantees delivered by our optimization-based proactive strategies. Extensive experiments show that RobustScaler outperforms common baseline autoscaling strategies in various real-world traces, with large margins for complex workload patterns.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/02/2021

QoS-Aware Power Minimization of Distributed Many-Core Servers using Transfer Q-Learning

Web servers scaled across distributed systems necessitate complex runtim...
research
02/24/2023

Uncertainty-Aware Workload Prediction in Cloud Computing

Predicting future resource demand in Cloud Computing is essential for ma...
research
01/19/2022

PROMPT: Learning Dynamic Resource Allocation Policies for Edge-Network Applications

A growing number of service providers are exploring methods to improve s...
research
12/20/2019

H2O-Cloud: A Resource and Quality of Service-Aware Task Scheduling Framework for Warehouse-Scale Data Centers

Cloud computing has attracted both end-users and Cloud Service Providers...
research
02/11/2023

CILP: Co-simulation based Imitation Learner for Dynamic Resource Provisioning in Cloud Computing Environments

Intelligent Virtual Machine (VM) provisioning is central to cost and res...
research
12/03/2022

An ADMM-Incorporated Latent Factorization of Tensors Method for QoS Prediction

As the Internet developed rapidly, it is important to choose suitable we...
research
07/23/2022

RIBBON: Cost-Effective and QoS-Aware Deep Learning Model Inference using a Diverse Pool of Cloud Computing Instances

Deep learning model inference is a key service in many businesses and sc...

Please sign up or login with your details

Forgot password? Click here to reset