Lynceus: Tuning and Provisioning Data Analytic Jobs on a Budget

05/06/2019
by   Maria Casimiro, et al.
0

Many enterprises need to run data analytic jobs on the cloud. Significant cost savings can be achieved if one can find the right combination of virtual machine type, cluster size, and parameter settings for the job (e.g., hyper-parameters of a machine learning algorithm). Unfortunately, this task is very challenging given that the search space is composed of hundreds or even thousands of different configurations. Lynceus is a new tool to provision and tune data analytic applications on the cloud. It does so automatically and in a cost-efficient manner. Lynceus implements a new budget-aware approach that builds the performance model of the target job by profiling the job on the best set of cloud/parameter configurations possible given constraints of both quality of service and monetary nature. Lynceus departs from state-of-the-art approaches that simply aim to reduce the number of configurations to try, disregarding the corresponding profiling costs, and that hence achieve a worse trade-off between the accuracy of the model and the cost to build it. We evaluate Lynceus on several heterogeneous data analytic jobs, running on different frameworks and with search spaces of different sizes. We compare Lynceus with the state-of-the-art approach, implemented by recent systems such as CherryPick, and show that it can consistently identify better (i.e., less expensive) configurations. This leads to cost reductions that range from 1.7x to 1.9x on average, and from 2x to 4x at the 90-th percentile.

READ FULL TEXT
research
03/12/2023

Scavenger: A Cloud Service for Optimizing Cost and Performance of ML Training

While the pay-as-you-go nature of cloud virtual machines (VMs) makes it ...
research
02/22/2021

Characterizing and Optimizing EDA Flows for the Cloud

Cloud computing accelerates design space exploration in logic synthesis,...
research
11/09/2020

TrimTuner: Efficient Optimization of Machine Learning Jobs in the Cloud via Sub-Sampling

This work introduces TrimTuner, the first system for optimizing machine ...
research
11/08/2022

Ruya: Memory-Aware Iterative Optimization of Cluster Configurations for Big Data Processing

Selecting appropriate computational resources for data processing jobs o...
research
03/04/2019

Workflow Scheduling in the Cloud with Weighted Upward-rank Priority Scheme Using Random Walk and Uniform Spare Budget Splitting

We study a difficult problem of how to schedule complex workflows with p...
research
02/17/2023

CarbonScaler: Leveraging Cloud Workload Elasticity for Optimizing Carbon-Efficiency

Cloud platforms are increasingly emphasizing sustainable operations in o...
research
02/01/2019

Hyper-parameter Tuning under a Budget Constraint

We study a budgeted hyper-parameter tuning problem, where we optimize th...

Please sign up or login with your details

Forgot password? Click here to reset