A Deep Reinforcement Learning based Algorithm for Time and Cost Optimized Scaling of Serverless Applications

08/22/2023
by   Anupama Mampage, et al.
0

Serverless computing has gained a strong traction in the cloud computing community in recent years. Among the many benefits of this novel computing model, the rapid auto-scaling capability of user applications takes prominence. However, the offer of adhoc scaling of user deployments at function level introduces many complications to serverless systems. The added delay and failures in function request executions caused by the time consumed for dynamically creating new resources to suit function workloads, known as the cold-start delay, is one such very prevalent shortcoming. Maintaining idle resource pools to alleviate this issue often results in wasted resources from the cloud provider perspective. Existing solutions to address this limitation mostly focus on predicting and understanding function load levels in order to proactively create required resources. Although these solutions improve function performance, the lack of understanding on the overall system characteristics in making these scaling decisions often leads to the sub-optimal usage of system resources. Further, the multi-tenant nature of serverless systems requires a scalable solution adaptable for multiple co-existing applications, a limitation seen in most current solutions. In this paper, we introduce a novel multi-agent Deep Reinforcement Learning based intelligent solution for both horizontal and vertical scaling of function resources, based on a comprehensive understanding on both function and system requirements. Our solution elevates function performance reducing cold starts, while also offering the flexibility for optimizing resource maintenance cost to the service providers. Experiments conducted considering varying workload scenarios show improvements of up to 23 latency and request failures, while also saving up to 45 cost for the service providers.

READ FULL TEXT

page 1

page 12

page 13

page 15

research
05/29/2020

AI-based Resource Allocation: Reinforcement Learning for Adaptive Auto-scaling in Serverless Environments

Serverless computing has emerged as a compelling new paradigm of cloud c...
research
05/28/2021

Performance Evaluation of Snapshot Methods to Warm the Serverless Cold Start

The serverless computing model strengthens the cloud computing tendency ...
research
04/29/2022

Cost Effective MLaaS Federation: A Combinatorial Reinforcement Learning Approach

With the advancement of deep learning techniques, major cloud providers ...
research
08/15/2023

Reinforcement Learning (RL) Augmented Cold Start Frequency Reduction in Serverless Computing

Function-as-a-Service is a cloud computing paradigm offering an event-dr...
research
01/14/2021

Impact of Distributed Rate Limiting on Load Distribution in a Latency-sensitive Messaging Service

The cloud's flexibility and promise of seamless auto-scaling notwithstan...
research
12/14/2020

WISE: A Computer System Performance Index Scoring Framework

The performance levels of a computing machine running a given workload c...

Please sign up or login with your details

Forgot password? Click here to reset