Hierarchical Coded Computation

06/26/2018
by   Nuwan Ferdinand, et al.
0

Coded computation is a method to mitigate "stragglers" in distributed computing systems through the use of error correction coding that has lately received significant attention. First used in vector-matrix multiplication, the range of application was later extended to include matrix-matrix multiplication, heterogeneous networks, convolution, and approximate computing. A drawback to previous results is they completely ignore work completed by stragglers. While stragglers are slower compute nodes, in many settings the amount of work completed by stragglers can be non-negligible. Thus, in this work, we propose a hierarchical coded computation method that exploits the work completed by all compute nodes. We partition each node's computation into layers of sub-computations such that each layer can be treated as (distinct) erasure channel. We then design different erasure codes for each layer so that all layers have the same failure exponent. We propose design guidelines to optimize parameters of such codes. Numerical results show the proposed scheme has an improvement of a factor of 1.5 in the expected finishing time compared to previous work.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/20/2019

Hierarchical Coded Matrix Multiplication

Slow working nodes, known as stragglers, can greatly reduce the speed of...
research
06/26/2018

Exploitation of Stragglers in Coded Computation

In cloud computing systems slow processing nodes, often referred to as "...
research
04/25/2019

Array BP-XOR Codes for Parallel Matrix Multiplication using Hierarchical Computing

This study presents a novel coded computation technique for parallel mat...
research
01/21/2019

Polar Coded Distributed Matrix Multiplication

We propose a polar coding mechanism for distributed matrix multiplicatio...
research
07/20/2019

Cuboid Partitioning for Hierarchical Coded Matrix Multiplication

Coded matrix multiplication is a technique to enable straggler-resistant...
research
09/01/2023

Randomized Polar Codes for Anytime Distributed Machine Learning

We present a novel distributed computing framework that is robust to slo...
research
03/07/2021

Adaptive Coding for Matrix Multiplication at Edge Networks

Edge computing is emerging as a new paradigm to allow processing data at...

Please sign up or login with your details

Forgot password? Click here to reset