On the Fundamental Limits of Coded Data Shuffling for Distributed Learning Systems

07/11/2018
by   Adel Elmahdy, et al.
0

We consider the data shuffling problem in a distributed learning system, in which a master node is connected to a set of worker nodes, via a shared link, in order to communicate a set of files to the worker nodes. The master node has access to a database of files. In every shuffling iteration, each worker node processes a new subset of files, and has excess storage to partially cache the remaining files, assuming the cached files are uncoded. The caches of the worker nodes are updated every iteration, and it should be designed to satisfy any possible unknown permutation of the files in subsequent iterations. For this problem, we characterize the exact rate-memory trade-off for worst-case shuffling by deriving the minimum communication load for a given storage capacity per worker node. As a byproduct, the exact rate-memory trade-off for any shuffling is characterized when the number of files is equal to the number of worker nodes. We propose a novel deterministic coded shuffling scheme, which improves the state of the art, by exploiting the cache memories to create coded functions that can be decoded by several worker nodes. Then, we prove the optimality of our proposed scheme by deriving a matching lower bound and showing that the placement phase of the proposed coded shuffling scheme is optimal over all shuffles.

READ FULL TEXT
research
10/21/2020

Coded Computing for Master-Aided Distributed Computing Systems

We consider a MapReduce-type task running in a distributed computing mod...
research
06/29/2018

Fundamental Limits of Distributed Data Shuffling

Data shuffling of training data among different computing nodes (workers...
research
05/21/2019

Fundamental Limits of Coded Caching: The Memory Rate Pair (K-1-1/K, 1/(K-1))

Maddah-Ali and Niesen, in a seminal paper, introduced the notion of code...
research
01/24/2021

The Exact Rate Memory Tradeoff for Large Caches with Coded Placement

The idea of coded caching for content distribution networks was introduc...
research
01/17/2019

On Coded Caching with Correlated Files

This paper studies the fundamental limits of the shared-link caching pro...
research
01/05/2018

Near Optimal Coded Data Shuffling for Distributed Learning

Data shuffling between distributed cluster of nodes is one of the critic...
research
01/14/2020

Coded Data Rebalancing: Fundamental Limits and Constructions

Distributed databases often suffer unequal distribution of data among st...

Please sign up or login with your details

Forgot password? Click here to reset