Coded Matrix Multiplication on a Group-Based Model

01/16/2019
by   Muah Kim, et al.
0

Coded distributed computing has been considered as a promising technique which makes large-scale systems robust to the "straggler" workers. Yet, practical system models for distributed computing have not been available that reflect the clustered or grouped structure of real-world computing servers. Neither the large variations in the computing power and bandwidth capabilities across different servers have been properly modeled. We suggest a group-based model to reflect practical conditions and develop an appropriate coding scheme for this model. The suggested code, called group code, employs parallel encoding for each group. We show that the suggested coding scheme can asymptotically achieve optimal computing time in regimes of infinite n, the number of workers. While theoretical analysis is conducted in the asymptotic regime, numerical results also show that the suggested scheme achieves near-optimal computing time for any finite but reasonably large n. Moreover, we demonstrate that the decoding complexity of the suggested scheme is significantly reduced by the virtue of parallel decoding.

READ FULL TEXT
research
01/15/2018

Hierarchical Coding for Distributed Computing

Coding for distributed computing supports low-latency computation by rel...
research
02/09/2018

Coded Sparse Matrix Multiplication

In a large-scale and distributed matrix multiplication problem C=A^B, wh...
research
09/22/2021

Numerically Stable Binary Coded Computations

This paper addresses the gradient coding and coded matrix multiplication...
research
04/20/2019

Optimal Load Allocation for Coded Distributed Computation in Heterogeneous Clusters

Recently, coding has been a useful technique to mitigate the effect of s...
research
01/20/2020

Bivariate Polynomial Coding for Exploiting Stragglers in Heterogeneous Coded Computing Systems

Polynomial coding has been proposed as a solution to the straggler mitig...
research
05/11/2023

Probabilistic Group Testing in Distributed Computing with Attacked Workers

The problem of distributed matrix-vector product is considered, where th...
research
07/20/2019

Hierarchical Coded Matrix Multiplication

Slow working nodes, known as stragglers, can greatly reduce the speed of...

Please sign up or login with your details

Forgot password? Click here to reset