Variable Coded Batch Matrix Multiplication

05/13/2021
by   Lev Tauz, et al.
0

In this paper, we introduce the Variable Coded Distributed Batch Matrix Multiplication (VCDBMM) problem which tasks a distributed system to perform batch matrix multiplication where matrices are not necessarily distinct among batch jobs. Most coded matrix-matrix computation work has broadly focused in two directions: matrix partitioning for computing a single computation task and batch processing of multiple distinct computation tasks. While these works provide codes with good straggler resilience and fast decoding for their problem spaces, these codes would not be able to take advantage of the natural redundancy of re-using matrices across batch jobs. Inspired by Cross-Subspace Alignment codes, we develop Flexible Cross-Subspace Alignments (FCSA) codes that are flexible enough to utilize this redundancy. We provide a full characterization of FCSA codes which allow for a wide variety of system complexities including good straggler resilience and fast decoding. We theoretically demonstrate that, under certain practical conditions, FCSA codes are within a factor of two of the optimal solution when it comes to straggler resilience; our simulations demonstrate that our codes achieve even better optimality gaps in practice.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/30/2019

Cross Subspace Alignment Codes for Coded Distributed Batch Computation

Coded distributed batch computation distributes a computation task, such...
research
09/30/2019

Cross Subspace Alignment Codes for Coded Distributed Batch Matrix Multiplication

The goal of coded distributed matrix multiplication (CDMM) is to efficie...
research
02/18/2020

GCSA Codes with Noise Alignment for Secure Coded Multi-Party Batch Matrix Multiplication

A secure multi-party batch matrix multiplication problem (SMBMM) is cons...
research
07/20/2019

Cuboid Partitioning for Hierarchical Coded Matrix Multiplication

Coded matrix multiplication is a technique to enable straggler-resistant...
research
01/21/2019

Polar Coded Distributed Matrix Multiplication

We propose a polar coding mechanism for distributed matrix multiplicatio...
research
02/07/2022

Locally Random Alloy Codes with Channel Coding Theorems for Distributed Matrix Multiplication

Matrix multiplication is a fundamental operation in machine learning and...
research
04/28/2023

Quantum Cross Subspace Alignment Codes via the N-sum Box Abstraction

Cross-subspace alignment (CSA) codes are used in various private informa...

Please sign up or login with your details

Forgot password? Click here to reset