A Lightweight Workload-Aware Microservices Autoscaling with QoS Assurance

01/31/2022
by   Md Rajib Hossen, et al.
0

Cloud applications are increasingly moving away from monolithic services to agile microservices-based deployments. However, efficient resource management for microservices poses a significant hurdle due to the sheer number of loosely coupled and interacting components. The interdependencies between various microservices make existing cloud resource autoscaling techniques ineffective. Meanwhile, machine learning (ML) based approaches that try to capture the complex relationships in microservices require extensive training data and cause intentional SLO violations. Moreover, these ML-heavy approaches are slow in adapting to dynamically changing microservice operating environments. In this paper, we propose PEMA (Practical Efficient Microservice Autoscaling), a lightweight microservice resource manager that finds efficient resource allocation through opportunistic resource reduction. PEMA's lightweight design enables novel workload-aware and adaptive resource management. Using three prototype microservice implementations, we show that PEMA can find close to optimum resource allocation and save up to 33 commercial rule-based resource allocations.

READ FULL TEXT
research
06/29/2021

workload forecasting and resource management models based on machine learning for cloud computing environments

The workload prediction and resource allocation significantly play an in...
research
05/18/2018

ALVEC: Auto-scaling by Lotka Volterra Elastic Cloud: A QoS aware Non Linear Dynamical Allocation Model

Elasticity in resource allocation is still a relevant problem in cloud c...
research
04/30/2020

A Survey on Time-Sensitive Resource Allocation in the Cloud Continuum

Artificial Intelligence (AI) and Internet of Things (IoT) applications a...
research
03/23/2022

Resource allocation optimization using artificial intelligence methods in various computing paradigms: A Review

With the advent of smart devices, the demand for various computational p...
research
04/10/2023

RAPID: Enabling Fast Online Policy Learning in Dynamic Public Cloud Environments

Resource sharing between multiple workloads has become a prominent pract...
research
12/12/2021

Sinan: Data Driven Resource Management for Cloud Microservices

Cloud applications are increasingly shifting to interactive and loosely-...
research
04/05/2018

SARA: Self-Aware Resource Allocation for Heterogeneous MPSoCs

In modern heterogeneous MPSoCs, the management of shared memory resource...

Please sign up or login with your details

Forgot password? Click here to reset