Wasserstein Adversarial Transformer for Cloud Workload Prediction

03/12/2022
by   Shivani Arbat, et al.
0

Predictive Virtual Machine (VM) auto-scaling is a promising technique to optimize cloud applications operating costs and performance. Understanding the job arrival rate is crucial for accurately predicting future changes in cloud workloads and proactively provisioning and de-provisioning VMs for hosting the applications. However, developing a model that accurately predicts cloud workload changes is extremely challenging due to the dynamic nature of cloud workloads. Long-Short-Term-Memory (LSTM) models have been developed for cloud workload prediction. Unfortunately, the state-of-the-art LSTM model leverages recurrences to predict, which naturally adds complexity and increases the inference overhead as input sequences grow longer. To develop a cloud workload prediction model with high accuracy and low inference overhead, this work presents a novel time-series forecasting model called WGAN-gp Transformer, inspired by the Transformer network and improved Wasserstein-GANs. The proposed method adopts a Transformer network as a generator and a multi-layer perceptron as a critic. The extensive evaluations with real-world workload traces show WGAN-gp Transformer achieves 5 times faster inference time with up to 5.1 percent higher prediction accuracy against the state-of-the-art approach. We also apply WGAN-gp Transformer to auto-scaling mechanisms on Google cloud platforms, and the WGAN-gp Transformer-based auto-scaling mechanism outperforms the LSTM-based mechanism by significantly reducing VM over-provisioning and under-provisioning rates.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/05/2022

EsDNN: Deep Neural Network based Multivariate Workload Prediction Approach in Cloud Environment

Cloud computing has been regarded as a successful paradigm for IT indust...
research
11/26/2022

A Quantum Approach Towards the Adaptive Prediction of Cloud Workloads

This work presents a novel Evolutionary Quantum Neural Network (EQNN) ba...
research
11/03/2021

Predictive Auto-scaling with OpenStack Monasca

Cloud auto-scaling mechanisms are typically based on reactive automation...
research
07/11/2023

PePNet: A Periodicity-Perceived Workload Prediction Network Supporting Rare Occurrence of Heavy Workload

Cloud providers can greatly benefit from accurate workload prediction. H...
research
02/17/2023

CarbonScaler: Leveraging Cloud Workload Elasticity for Optimizing Carbon-Efficiency

Cloud platforms are increasingly emphasizing sustainable operations in o...
research
08/02/2023

A Transformer-based Prediction Method for Depth of Anesthesia During Target-controlled Infusion of Propofol and Remifentanil

Accurately predicting anesthetic effects is essential for target-control...
research
11/22/2022

A case study of proactive auto-scaling for an ecommerce workload

Preliminary data obtained from a partnership between the Federal Univers...

Please sign up or login with your details

Forgot password? Click here to reset