Runtime QoS service for application-driven adaptation in network computing

07/30/2019
by   Feras Al-Hawari, et al.
0

A distributed application executing on a Network of Workstations (NOW) needs to be resource state aware to possibly adapt itself accordingly in order to keep satisfying the desired Quality of Service (QoS) demands throughout its lifespan. We implemented a QoS service to enable application-driven adaptation for performance and fault tolerance at runtime. The service is associated with lightweight middleware that monitors the state and load of all application entities (e.g., machines, tasks, and logical network links). Moreover, it makes its services available to an application task via an anonymous and simple to use QoS API. We present a Manager-Worker application that uses our fault tolerance QoS API to adapt for Worker faults in order to avoid application deadlock at runtime. Moreover, we show how a dynamic application-level scheduler can easily utilize the QoS API to find efficient schedules. Furthermore, we quantified the overhead of the QoS middleware in various scenarios to demonstrate that it has minor impact on the performance of the application it is servicing.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/05/2018

Characterizing QoS Parameters and Application of Soft-QoS Scheme for 3G Wireless Networks

In wireless communication systems, Quality of Service (QoS) is one of th...
research
12/26/2019

Fault Tolerance in SDN Data Plane Considering Network and Application Based Metrics

Failures in networks result in service disruptions which may cause deter...
research
05/19/2022

Probabilistic Quality of Service aware Service Selection

In software-as-a-service paradigms software systems are no longer monoli...
research
08/27/2020

BumbleBee: Application-aware adaptation for container orchestration

Modern applications have embraced separation of concerns as a first-orde...
research
03/14/2022

CAROL: Confidence-Aware Resilience Model for Edge Federations

In recent years, the deployment of large-scale Internet of Things (IoT) ...
research
11/30/2017

FRAPpuccino: Fault-detection through Runtime Analysis of Provenance

We present FRAPpuccino (or FRAP), a provenance-based fault detection mec...
research
05/05/2020

Towards QoS-Aware and Resource-Efficient GPU Microservices Based on Spatial Multitasking GPUs In Datacenters

While prior researches focus on CPU-based microservices, they are not ap...

Please sign up or login with your details

Forgot password? Click here to reset