ByRDiE: Byzantine-resilient distributed coordinate descent for decentralized learning

08/28/2017
by   Zhixiong Yang, et al.
0

Distributed machine learning algorithms enable processing of datasets that are distributed over a network without gathering the data at a centralized location. While efficient distributed algorithms have been developed under the assumption of faultless networks, failures that can render these algorithms nonfunctional indeed happen in the real world. This paper focuses on the problem of Byzantine failures, which are the hardest to safeguard against in distributed algorithms. While Byzantine fault tolerance has a rich history, existing work does not translate into efficient and practical algorithms for high-dimensional distributed learning tasks. In this paper, two variants of an algorithm termed Byzantine-resilient distributed coordinate descent (ByRDiE) are developed and analyzed that solve distributed learning problems in the presence of Byzantine failures. Theoretical analysis as well as numerical experiments presented in the paper highlight the usefulness of ByRDiE for high-dimensional distributed learning in the presence of Byzantine failures.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/21/2019

BRIDGE: Byzantine-resilient Decentralized Gradient Descent

Decentralized optimization techniques are increasingly being used to lea...
research
07/25/2023

High Dimensional Distributed Gradient Descent with Arbitrary Number of Byzantine Attackers

Robust distributed learning with Byzantine failures has attracted extens...
research
04/20/2023

Byzantine-Resilient Learning Beyond Gradients: Distributing Evolutionary Search

Modern machine learning (ML) models are capable of impressive performanc...
research
02/04/2022

SignSGD: Fault-Tolerance to Blind and Byzantine Adversaries

Distributed learning has become a necessity for training ever-growing mo...
research
08/23/2019

Adversary-resilient Inference and Machine Learning: From Distributed to Decentralized

While the last few decades have witnessed a huge body of work devoted to...
research
05/05/2022

Byzantine Fault Tolerance in Distributed Machine Learning : a Survey

Byzantine Fault Tolerance (BFT) is among the most challenging problems i...
research
07/04/2022

On implementing SWMR registers from SWSR registers in systems with Byzantine failures

The implementation of registers from (potentially) weaker registers is a...

Please sign up or login with your details

Forgot password? Click here to reset