Pronto: Federated Task Scheduling

04/27/2021
by   Andreas Grammenos, et al.
0

We present a federated, asynchronous, memory-limited algorithm for online task scheduling across large-scale networks of hundreds of workers. This is achieved through recent advancements in federated edge computing that unlocks the ability to incrementally compute local model updates within each node separately. This local model is then used along with incoming data to generate a rejection signal which reflects the overall node responsiveness and if it is able to accept an incoming task without resulting in degraded performance. Through this innovation, we allow each node to execute scheduling decisions on whether to accept an incoming job independently based on the workload seen thus far. Further, using the aggregate of the iterates a global view of the system can be constructed, as needed, and could be used to produce a holistic perspective of the system. We complement our findings, by an empirical evaluation on a large-scale real-world dataset of traces from a virtualized production data center that shows, while using limited memory, that our algorithm exhibits state-of-the-art performance. Concretely, it is able to predict changes in the system responsiveness ahead of time based on the industry-standard CPU-Ready metric and, in turn, can lead to better scheduling decisions and overall utilization of the available resources. Finally, in the absence of communication latency, it exhibits attractive horizontal scalability.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/18/2019

Federated PCA with Adaptive Rank Estimation

In many online machine learning and data science tasks such as data summ...
research
08/20/2023

Eventually-Consistent Federated Scheduling for Data Center Workloads

Data center schedulers operate at unprecedented scales today to accommod...
research
01/15/2021

CPU Scheduling in Data Centers Using Asynchronous Finite-Time Distributed Coordination Mechanisms

We propose an asynchronous iterative scheme which allows a set of interc...
research
04/01/2020

Scheduling in Cellular Federated Edge Learning with Importance and Channel Awareness

In cellular federated edge learning (FEEL), multiple edge devices holdin...
research
03/29/2023

An Efficient Online Prediction of Host Workloads Using Pruned GRU Neural Nets

Host load prediction is essential for dynamic resource scaling and job s...
research
04/07/2021

Optimal CPU Scheduling in Data Centers via a Finite-Time Distributed Quantized Coordination Mechanism

In this paper we analyze the problem of optimal task scheduling for data...
research
07/04/2021

Towards Scheduling Federated Deep Learning using Meta-Gradients for Inter-Hospital Learning

Given the abundance and ease of access of personal data today, individua...

Please sign up or login with your details

Forgot password? Click here to reset