FaaSched: A Jitter-Aware Serverless Scheduler

03/11/2023
by   Abhisek Panda, et al.
0

Serverless computing systems are becoming very popular. Large corporations such as Netflix, Airbnb, and Coca-Cola use such systems for running their websites and IT systems. The advantages of such systems include superior support for auto-scaling, load balancing, and fast distributed processing. These are multi-QoS systems where different classes of applications have different latency and jitter (variation in the latency) requirements: we consider a mix of latency-sensitive (LS) and latency-desirable (LD) applications. Ensuring proper schedulability and QoS enforcement of LS applications is non-trivial. We need to minimize the jitter without increasing the response latency of LS applications, and we also need to keep the degradation of the response latency of LD applications in check. This is the first paper in this domain that achieves a trade-off between the jitter suffered by LS applications and the response latency of LD applications. We minimize the former with a bound on the latter using a reinforcement learning (RL) based scheme. To design such an RL scheme, we performed detailed characterization studies to find the input variables of interest, defined novel state representations, and proposed a bespoke reward function that allows us to achieve this trade-off. For an aggressive use case comprising five popular LS and LD applications each, we show a reduction in response time variance and mean latency of 50.31 degradation in the execution latency of LD applications was limited to 19.88

READ FULL TEXT

page 5

page 6

page 10

page 11

research
03/22/2023

Communication Load Balancing via Efficient Inverse Reinforcement Learning

Communication load balancing aims to balance the load between different ...
research
02/04/2022

A Reinforcement Learning Framework for PQoS in a Teleoperated Driving Scenario

In recent years, autonomous networks have been designed with Predictive ...
research
06/15/2023

Generalizable Resource Scaling of 5G Slices using Constrained Reinforcement Learning

Network slicing is a key enabler for 5G to support various applications....
research
02/22/2023

Towards Decentralized Predictive Quality of Service in Next-Generation Vehicular Networks

To ensure safety in teleoperated driving scenarios, communication betwee...
research
05/11/2022

Statistical Characterization of Closed-Loop Latency at the Mobile Edge

The stringent timing and reliability requirements in mission-critical ap...
research
11/10/2022

A Graph Neural Networks based Framework for Topology-Aware Proactive SLA Management in a Latency Critical NFV Application Use-case

Recent advancements in the rollout of 5G and 6G have led to the emergenc...
research
08/19/2016

POLYPATH: Supporting Multiple Tradeoffs for Interaction Latency

Modern mobile systems use a single input-to-display path to serve all ap...

Please sign up or login with your details

Forgot password? Click here to reset