On Batch-Processing Based Coded Computing for Heterogeneous Distributed Computing Systems

12/29/2019
by   Baoqian Wang, et al.
0

In recent years, coded distributed computing (CDC) has attracted significant attention, because it can efficiently facilitate many delay-sensitive computation tasks against unexpected latencies in different distributed computing systems. Despite such a salient feature, there are still many design challenges and opportunities. In this paper, we focus on practical computing systems with heterogeneous computing resources, and propose a novel CDC approach, called batch-processing based coded computing (BPCC), which exploits the fact that every computing node can obtain some coded results before it completes the whole task. To this end, we first describe the main idea of the BPCC framework, and then formulate an optimization problem for BPCC to minimize the task completion time by configuring the computation load and number of batches assigned to each computing node. Based on whether batch-induced overhead can be neglected or not, we develop two BPCC schemes, namely BPCC-1 and BPCC-2, for negligible and linear batching overheads, respectively. Through solid theoretical analyses, extensive simulation studies, and comprehensive real experiments on two heterogeneous distributed computing systems: 1) an Amazon EC2 computing cluster, and 2) an unmanned aerial vehicle (UAV)-based airborne computing platform, we demonstrate the high computational and energy efficiency of the proposed BPCC schemes.

READ FULL TEXT
research
04/16/2019

Heterogeneous Coded Computation across Heterogeneous Workers

Coded distributed computing framework enables large-scale machine learni...
research
02/27/2019

Coded Distributed Computing with Heterogeneous Function Assignments

Coded distributed computing (CDC) introduced by Li et. al. is an effecti...
research
07/13/2020

Bottom-up mechanism and improved contract net protocol for the dynamic task planning of heterogeneous Earth observation resources

Earth observation resources are becoming increasingly indispensable in d...
research
01/05/2022

Dynamic Coded Distributed Convolution for UAV-based Networked Airborne Computing

A single unmanned aerial vehicle (UAV) has limited computing resources a...
research
04/16/2019

Heterogeneous Computation across Heterogeneous Workers

Coded distributed computing framework enables large-scale machine learni...
research
04/15/2021

Multi-Agent Reinforcement Learning Based Coded Computation for Mobile Ad Hoc Computing

Mobile ad hoc computing (MAHC), which allows mobile devices to directly ...
research
07/18/2021

A Practical Algorithm Design and Evaluation for Heterogeneous Elastic Computing with Stragglers

Our extensive real measurements over Amazon EC2 show that the virtual in...

Please sign up or login with your details

Forgot password? Click here to reset