Erasure coding for distributed matrix multiplication for matrices with bounded entries

11/06/2018
by   Li Tang, et al.
0

Distributed matrix multiplication is widely used in several scientific domains. It is well recognized that computation times on distributed clusters are often dominated by the slowest workers (called stragglers). Recent work has demonstrated that straggler mitigation can be viewed as a problem of designing erasure codes. For matrices A and B, the technique essentially maps the computation of A^T B into the multiplication of smaller (coded) submatrices. The stragglers are treated as erasures in this process. The computation can be completed as long as a certain number of workers (called the recovery threshold) complete their assigned tasks. We present a novel coding strategy for this problem when the absolute values of the matrix entries are sufficiently small. We demonstrate a tradeoff between the assumed absolute value bounds on the matrix entries and the recovery threshold. At one extreme, we are optimal with respect to the recovery threshold and on the other extreme, we match the threshold of prior work. Experimental results on cloud-based clusters validate the benefits of our method.

READ FULL TEXT
research
01/31/2018

On the Optimal Recovery Threshold of Coded Matrix Multiplication

We provide novel coded computation strategies for distributed matrix-mat...
research
09/19/2023

Distributed Matrix Multiplication with a Smaller Recovery Threshold through Modulo-based Approaches

This paper considers the problem of calculating the matrix multiplicatio...
research
04/27/2020

Rateless Codes for Private Distributed Matrix-Matrix Multiplication

We consider the problem of designing rateless coded private distributed ...
research
01/03/2022

A Systematic Approach towards Efficient Private Matrix Multiplication

We consider the problems of Private and Secure Matrix Multiplication (PS...
research
12/26/2018

CodedSketch: A Coding Scheme for Distributed Computation of Approximated Matrix Multiplication

In this paper, we propose CodedSketch, as a distributed straggler-resist...
research
03/04/2021

Straggler Mitigation through Unequal Error Protection for Distributed Approximate Matrix Multiplication

Large-scale machine learning and data mining methods routinely distribut...
research
12/26/2018

CodedSketch: A Coding Scheme for Distributed Computation of Approximated Matrix Multiplications

In this paper, we propose CodedSketch, as a distributed straggler-resist...

Please sign up or login with your details

Forgot password? Click here to reset