FLCD: A Flexible Low Complexity Design of Coded Distributed Computing

08/13/2020
by   Nicholas Woolsey, et al.
0

We propose a flexible low complexity design (FLCD) of coded distributed computing (CDC) with empirical evaluation on Amazon Elastic Compute Cloud (Amazon EC2). CDC can expedite MapReduce like computation by trading increased map computations to reduce communication load and shuffle time. A main novelty of FLCD is to utilize the design freedom in defining map and reduce functions to develop asymptotic homogeneous systems to support varying intermediate values (IV) sizes under a general MapReduce framework. Compared to existing designs with constant IV sizes, FLCD offers greater flexibility in adapting to network parameters and significantly reduces the implementation complexity by requiring fewer input files and shuffle groups. The FLCD scheme is the first proposed low-complexity CDC design that can operate on a network with an arbitrary number of nodes and computation load. We perform empirical evaluations of the FLCD by executing the TeraSort algorithm on an Amazon EC2 cluster. This is the first time that theoretical predictions of the CDC shuffle time are validated by empirical evaluations. The evaluations demonstrate a 2.0 to 4.24x speedup compared to conventional uncoded MapReduce, a 12 reduction in total time, and a wider range of operating network parameters compared to existing CDC schemes.

READ FULL TEXT
research
07/27/2023

Cascaded Code Distributed Computing With Low Complexity and Improved Flexibility

Coded distributed computing, proposed by Li et al., offers significant p...
research
08/02/2020

A Combinatorial Design for Cascaded Coded Distributed Computing on General Networks

Coding theoretic approached have been developed to significantly reduce ...
research
07/21/2020

A New Combinatorial Coded Design for Heterogeneous Distributed Computing

Coded Distributed Computing (CDC) introduced by Li et al. in 2015 offers...
research
02/12/2018

A New Combinatorial Design of Coded Distributed Computing

Coded distributed computing introduced by Li et al. in 2015 is an effici...
research
01/15/2020

Low Complexity Distributed Computing via Binary Matrices with Extension to Stragglers

We consider the distributed computing framework of Map-Reduce, which con...
research
02/12/2023

Asymptotically Optimal Cascaded Coded Distributed Computing via Combinatorial Designs

Coded distributed computing (CDC) introduced by Li et al. can greatly re...
research
02/08/2018

Leveraging Coding Techniques for Speeding up Distributed Computing

Large scale clusters leveraging distributed computing frameworks such as...

Please sign up or login with your details

Forgot password? Click here to reset