Optimized Memoryless Fair-Share HPC Resources Scheduling using Transparent Checkpoint-Restart Preemption

02/25/2021
by   Kfir Zvi, et al.
0

Common resource management methods in supercomputing systems usually include hard divisions, capping, and quota allotment. Those methods, despite their 'advantages', have some known serious disadvantages including unoptimized utilization of an expensive facility, and occasionally there is still a need to dynamically reschedule and reallocate the resources. Consequently, those methods involve bad supply-and-demand management rather than a free market playground that will eventually increase system utilization and productivity. In this work, we propose the newly Optimized Memoryless Fair-Share HPC Resources Scheduling using Transparent Checkpoint-Restart Preemption, in which the social welfare increases using a free-of-cost interchangeable proprietary possession scheme. Accordingly, we permanently keep the status-quo in regard to the fairness of the resources distribution while maximizing the ability of all users to achieve more CPUs and CPU hours for longer period without any non-straightforward costs, penalties or additional human intervention.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/10/2020

Scheduling Beyond CPUs for HPC

High performance computing (HPC) is undergoing significant changes. The ...
research
05/21/2019

Exploring the Fairness and Resource Distribution in an Apache Mesos Environment

Apache Mesos, a cluster-wide resource manager, is widely deployed in mas...
research
03/10/2021

A Resourceful Coordination Approach for Multilevel Scheduling

HPC users aim to improve their execution times without particular regard...
research
03/16/2021

Intelligent colocation of HPC workloads

Many HPC applications suffer from a bottleneck in the shared caches, ins...
research
10/11/2022

Fair and Efficient Multi-Resource Allocation for Cloud Computing

We study the problem of allocating multiple types of resources to agents...
research
11/27/2019

Dynamically Provisioning Cray DataWarp Storage

Complex applications and workflows needs are often exclusively expressed...
research
08/03/2019

An Optimized Disk Scheduling Algorithm With Bad-Sector Management

In high performance computing, researchers try to optimize the CPU Sched...

Please sign up or login with your details

Forgot password? Click here to reset