Optimal Resource Allocation for Serverless Queries

07/19/2021
by   Anish Pimpley, et al.
11

Optimizing resource allocation for analytical workloads is vital for reducing costs of cloud-data services. At the same time, it is incredibly hard for users to allocate resources per query in serverless processing systems, and they frequently misallocate by orders of magnitude. Unfortunately, prior work focused on predicting peak allocation while ignoring aggressive trade-offs between resource allocation and run-time. Additionally, these methods fail to predict allocation for queries that have not been observed in the past. In this paper, we tackle both these problems. We introduce a system for optimal resource allocation that can predict performance with aggressive trade-offs, for both new and past observed queries. We introduce the notion of a performance characteristic curve (PCC) as a parameterized representation that can compactly capture the relationship between resources and performance. To tackle training data sparsity, we introduce a novel data augmentation technique to efficiently synthesize the entire PCC using a single run of the query. Lastly, we demonstrate the advantages of a constrained loss function coupled with GNNs, over traditional ML methods, for capturing the domain specific behavior through an extensive experimental evaluation over SCOPE big data workloads at Microsoft.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/18/2018

ALVEC: Auto-scaling by Lotka Volterra Elastic Cloud: A QoS aware Non Linear Dynamical Allocation Model

Elasticity in resource allocation is still a relevant problem in cloud c...
research
04/11/2020

Optimizing Cursor Loops in Relational Databases

Loops that iterate over SQL query results are quite common, both in appl...
research
09/08/2023

On the performance of an integrated communication and localization system: an analytical framework

Quantifying the performance bound of an integrated localization and comm...
research
04/07/2023

Runtime Variation in Big Data Analytics

The dynamic nature of resource allocation and runtime conditions on Clou...
research
07/11/2022

IaaS Procurement by Simulated Annealing

Considering the problem of resource allocation for potentially complex a...

Please sign up or login with your details

Forgot password? Click here to reset