Coded Elastic Computing on Machines with Heterogeneous Storage and Computation Speed

08/12/2020
by   Nicholas Woolsey, et al.
0

We study the optimal design of heterogeneous Coded Elastic Computing (CEC) where machines have varying computation speeds and storage. CEC introduced by Yang et al. in 2018 is a framework that mitigates the impact of elastic events, where machines can join and leave at arbitrary times. In CEC, data is distributed among machines using a Maximum Distance Separable (MDS) code such that subsets of machines can perform the desired computations. However, state-of-the-art CEC designs only operate on homogeneous networks where machines have the same speeds and storage. This may not be practical. In this work, based on an MDS storage assignment, we develop a novel computation assignment approach for heterogeneous CEC networks to minimize the overall computation time. We first consider the scenario where machines have heterogeneous computing speeds but same storage and then the scenario where both heterogeneities are present. We propose a novel combinatorial optimization formulation and solve it exactly by decomposing it into a convex optimization problem for finding the optimal computation load and a "filling problem" for finding the exact computation assignment. A low-complexity "filling algorithm" is adapted and can be completed within a number of iterations equals at most the number of available machines.

READ FULL TEXT
research
01/12/2020

Heterogeneous Computation Assignments in Coded Elastic Computing

We study the optimal design of a heterogeneous coded elastic computing (...
research
07/18/2021

A Practical Algorithm Design and Evaluation for Heterogeneous Elastic Computing with Stragglers

Our extensive real measurements over Amazon EC2 show that the virtual in...
research
07/20/2021

A New Design Framework for Heterogeneous Uncoded Storage Elastic Computing

Elasticity is one important feature in modern cloud computing systems an...
research
02/27/2019

Coded Distributed Computing with Heterogeneous Function Assignments

Coded distributed computing (CDC) introduced by Li et. al. is an effecti...
research
12/16/2018

Coded Elastic Computing

Cloud providers have recently introduced low-priority machines to reduce...
research
10/02/2019

Optimizing the Transition Waste in Coded Elastic Computing

Distributed computing, in which a resource-intensive task is divided int...
research
01/30/2023

Distributed Matrix Computations with Low-weight Encodings

Straggler nodes are well-known bottlenecks of distributed matrix computa...

Please sign up or login with your details

Forgot password? Click here to reset