Scout: An Experienced Guide to Find the Best Cloud Configuration

03/04/2018
by   Chin-Jung Hsu, et al.
0

Finding the right cloud configuration for workloads is an essential step to ensure good performance and contain running costs. A poor choice of cloud configuration decreases application performance and increases running cost significantly. While Bayesian Optimization is effective and applicable to any workloads, it is fragile because performance and workload are hard to model (to predict). In this paper, we propose a novel method, SCOUT. The central insight of SCOUT is that using prior measurements, even those for different workloads, improves search performance and reduces search cost. At its core, SCOUT extracts search hints (inference of resource requirements) from low-level performance metrics. Such hints enable SCOUT to navigate through the search space more efficiently---only spotlight region will be searched. We evaluate SCOUT with 107 workloads on Apache Hadoop and Spark. The experimental results demonstrate that our approach finds better cloud configurations with a lower search cost than state of the art methods. Based on this work, we conclude that (i) low-level performance information is necessary for finding the right cloud configuration in an effective, efficient and reliable way, and (ii) a search method can be guided by historical data, thereby reducing cost and improving performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/28/2017

Low-Level Augmented Bayesian Optimization for Finding the Best Cloud VM

With the advent of big data applications, which tends to have longer exe...
research
06/28/2020

Fast and Low-cost Search for Efficient Cloud Configurations for HPC Workloads

The use of cloud computational resources has become increasingly importa...
research
03/15/2018

Micky: A Cheaper Alternative for Selecting Cloud Instances

Most cloud computing optimizers explore and improve one workload at a ti...
research
04/11/2022

Cello: Efficient Computer Systems Optimization with Predictive Early Termination and Censored Regression

Sample-efficient machine learning (SEML) has been widely applied to find...
research
04/20/2022

Search-based Methods for Multi-Cloud Configuration

Multi-cloud computing has become increasingly popular with enterprises l...
research
06/30/2022

Exploiting Inherent Elasticity of Serverless in Irregular Algorithms

Serverless computing, in particular the Function-as-a-Service (FaaS) exe...
research
12/16/2017

StackInsights: Cognitive Learning for Hybrid Cloud Readiness

Hybrid cloud is an integrated cloud computing environment utilizing a mi...

Please sign up or login with your details

Forgot password? Click here to reset