Incentive Mechanism Design for Distributed Coded Machine Learning

12/16/2020
by   Ningning Ding, et al.
0

A distributed machine learning platform needs to recruit many heterogeneous worker nodes to finish computation simultaneously. As a result, the overall performance may be degraded due to straggling workers. By introducing redundancy into computation, coded machine learning can effectively improve the runtime performance by recovering the final computation result through the first k (out of the total n) workers who finish computation. While existing studies focus on designing efficient coding schemes, the issue of designing proper incentives to encourage worker participation is still under-explored. This paper studies the platform's optimal incentive mechanism for motivating proper workers' participation in coded machine learning, despite the incomplete information about heterogeneous workers' computation performances and costs. A key contribution of this work is to summarize workers' multi-dimensional heterogeneity as a one-dimensional metric, which guides the platform's efficient selection of workers under incomplete information with a linear computation complexity. Moreover, we prove that the optimal recovery threshold k is linearly proportional to the participator number n if we use the widely adopted MDS (Maximum Distance Separable) codes for data encoding. We also show that the platform's increased cost due to incomplete information disappears when worker number is sufficiently large, but it does not monotonically decrease in worker number.

READ FULL TEXT
research
07/04/2020

Coded Distributed Computing with Partial Recovery

Coded computation techniques provide robustness against straggling worke...
research
01/31/2018

On the Optimal Recovery Threshold of Coded Matrix Multiplication

We provide novel coded computation strategies for distributed matrix-mat...
research
05/11/2023

Efficient Coded Multi-Party Computation at Edge Networks

Multi-party computation (MPC) is promising for designing privacy-preserv...
research
11/22/2017

Combating Computational Heterogeneity in Large-Scale Distributed Computing via Work Exchange

Owing to data-intensive large-scale applications, distributed computatio...
research
01/23/2020

Coded Computing for Boolean Functions

The growing size of modern datasets necessitates a massive computation i...
research
11/20/2021

Reliable Coded Distributed Computing for Metaverse Services: Coalition Formation and Incentive Mechanism Design

The metaverse is regarded as a new wave of technological transformation ...
research
06/06/2022

Optimization-based Block Coordinate Gradient Coding for Mitigating Partial Stragglers in Distributed Learning

Gradient coding schemes effectively mitigate full stragglers in distribu...

Please sign up or login with your details

Forgot password? Click here to reset