Universal Coded Distributed Computing For MapReduce Frameworks

01/17/2022
by   Yuhan Wang, et al.
0

Coded distributed computing (CDC) can trade extra computing power to reduce the communication load for the MapReduce-type systems. The optimal computation-communication tradeoff has been well studied for homogeneous systems, and some results have also been obtained under the heterogeneous condition in recent studies. However, the previous works allow the file placement and Reduce function assignment free to design for the scheme. In this paper, we consider the general heterogeneous MapReduce system, where the file placement and Reduce function assignment are arbitrary but prefixed among all nodes (i.e., can not be designed by the scheme), and the storage and the computational capabilities for different nodes are not necessarily equal. We propose two universal CDC schemes and establish upper bounds of the optimal communication load. The first achievable scheme, namely One-Shot Coded Transmission (OSCT), encodes the intermediate values (IVs) into message blocks with different sizes to exploit the multicasting gain, and each message block can be decoded independently by the intended nodes. The second scheme, namely Few-Shot Coded Transmission (FSCT), splits IVs into smaller pieces and each node jointly decodes multiple message blocks to obtain its desired IVs. We prove that our OSCT and FSCT are optimal in many cases, and give sufficient conditions for the optimality of OSCT and FSCT, respectively.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/27/2019

Coded Distributed Computing with Heterogeneous Function Assignments

Coded distributed computing (CDC) introduced by Li et. al. is an effecti...
research
01/23/2019

Cascaded Coded Distributed Computing on Heterogeneous Networks

Coded distributed computing (CDC) introduced by Li et al. in 2015 offers...
research
08/19/2019

Heterogeneous Coded Distributed Computing: Joint Design of File Allocation and Function Assignment

This paper studies the computation-communication tradeoff in a heterogen...
research
01/23/2019

A Fundamental Storage-Communication Tradeoff in Distributed Computing with Straggling Nodes

The optimal storage-computation tradeoff is characterized for a MapReduc...
research
02/01/2018

Distributed Computing with Heterogeneous Communication Constraints: The Worst-Case Computation Load and Proof by Contradiction

We consider a distributed computing framework where the distributed node...
research
05/12/2022

Coded Data Rebalancing for Distributed Data Storage Systems with Cyclic Storage

We consider replication-based distributed storage systems in which each ...
research
07/09/2023

Sharper Asymptotically Optimal CDC Schemes via Combinatorial Designs

Coded distributed computing (CDC) was introduced to greatly reduce the c...

Please sign up or login with your details

Forgot password? Click here to reset