Heterogeneous Computation Assignments in Coded Elastic Computing

01/12/2020
by   Nicholas Woolsey, et al.
0

We study the optimal design of a heterogeneous coded elastic computing (CEC) network where machines have varying relative computation speeds. CEC introduced by Yang et al. is a framework which mitigates the impact of elastic events, where machines join and leave the network. A set of data is distributed among storage constrained machines using a Maximum Distance Separable (MDS) code such that any subset of machines of a specific size can perform the desired computations. This design eliminates the need to re-distribute the data after each elastic event. In this work, we develop a process for an arbitrary heterogeneous computing network to minimize the overall computation time by defining an optimal computation load, or number of computations assigned to each machine. We then present an algorithm to define a specific computation assignment among the machines that makes use of the MDS code and meets the optimal computation load.

READ FULL TEXT
research
08/12/2020

Coded Elastic Computing on Machines with Heterogeneous Storage and Computation Speed

We study the optimal design of heterogeneous Coded Elastic Computing (CE...
research
07/18/2021

A Practical Algorithm Design and Evaluation for Heterogeneous Elastic Computing with Stragglers

Our extensive real measurements over Amazon EC2 show that the virtual in...
research
07/20/2021

A New Design Framework for Heterogeneous Uncoded Storage Elastic Computing

Elasticity is one important feature in modern cloud computing systems an...
research
12/16/2018

Coded Elastic Computing

Cloud providers have recently introduced low-priority machines to reduce...
research
10/02/2019

Optimizing the Transition Waste in Coded Elastic Computing

Distributed computing, in which a resource-intensive task is divided int...
research
06/19/2022

Hierarchical coded elastic computing

Elasticity is offered by cloud service providers to exploit under-utiliz...
research
01/30/2023

Distributed Matrix Computations with Low-weight Encodings

Straggler nodes are well-known bottlenecks of distributed matrix computa...

Please sign up or login with your details

Forgot password? Click here to reset