Byzantine Fault Tolerance in Distributed Machine Learning : a Survey

05/05/2022
by   Djamila Bouhata, et al.
0

Byzantine Fault Tolerance (BFT) is among the most challenging problems in Distributed Machine Learning (DML). Byzantine failures are still difficult to tackle due to their unrestricted nature; as a result, the possibility of generating arbitrary data. Extensive research efforts are persistently being made to put into effect the BFT in DML. Some recent studies have been undertaken to take into consideration various BFT approaches in DML. However, certain aspects seem to be limited, such as the few analyzed approaches and the absence of the techniques classification employed in the studied approaches. In this paper, we present a survey of recent works surrounding BFT in DML. Mainly in first-order optimization methods, especially Stochastic Gradient Descent (SGD). We highlight the key techniques as well as fundamental approaches. We offer an illustrative description of techniques used in BFT in DML, with a proposed classification of BFTs approaches in the context of their basic techniques. This classification is established on specific criteria such as communication process, optimization method, and topology setting, which characterize future work methods examining the current challenges of BFT in DML

READ FULL TEXT

page 2

page 8

research
08/26/2021

Byzantine Fault-Tolerance in Federated Local SGD under 2f-Redundancy

We consider the problem of Byzantine fault-tolerance in federated machin...
research
06/16/2021

A Survey on Fault-tolerance in Distributed Optimization and Machine Learning

The robustness of distributed optimization is an emerging field of study...
research
03/08/2017

Byzantine-Tolerant Machine Learning

The growth of data, the need for scalability and the complexity of model...
research
08/28/2017

ByRDiE: Byzantine-resilient distributed coordinate descent for decentralized learning

Distributed machine learning algorithms enable processing of datasets th...
research
01/28/2021

Byzantine Fault-Tolerance in Peer-to-Peer Distributed Gradient-Descent

We consider the problem of Byzantine fault-tolerance in the peer-to-peer...
research
12/04/2017

Revisiting Fast Practical Byzantine Fault Tolerance

In this note, we observe a safety violation in Zyzzyva and a liveness vi...

Please sign up or login with your details

Forgot password? Click here to reset