ELFISH: Resource-Aware Federated Learning on Heterogeneous Edge Devices

12/03/2019
by   Zirui Xu, et al.
0

In this work, we propose ELFISH - a resource-aware federated learning framework to tackle computation stragglers in federated learning. In ELFISH, neural network models' training consumption will be firstly profiled in terms of different computation resources. Guided by profiling, a "soft-training" method is proposed for straggler acceleration, which partially trains the model by masking a particular number of resource-intensive neurons. Rather than generating a deterministically optimized model with diverged structure, different sets of neurons will be dynamically masked every training cycle and will be recovered and updated during parameter aggregation, ensuring comprehensive model updates overtime. The corresponding parameter aggregation scheme is also proposed to balance the contribution from soft-trained models and guarantee the collaborative convergence. Eventually, ELFISH overcomes the computational heterogeneity of edge devices and achieves synchronized collaboration without computational stragglers. Experiments show that ELFISH can provide up to 2x training acceleration with soft-training in various straggler settings. Furthermore, benefited from the proposed parameter aggregation scheme, ELFISH improves the model accuracy for 4 collaborative convergence robustness.

READ FULL TEXT
research
07/12/2023

Tackling Computational Heterogeneity in FL: A Few Theoretical Insights

The future of machine learning lies in moving data collection along with...
research
10/26/2022

Hierarchical Federated Learning with Momentum Acceleration in Multi-Tier Networks

In this paper, we propose Hierarchical Federated Learning with Momentum ...
research
11/09/2022

Resource-Aware Heterogeneous Federated Learning using Neural Architecture Search

Federated Learning (FL) is extensively used to train AI/ML models in dis...
research
06/07/2023

Resource Aware Clustering for Tackling the Heterogeneity of Participants in Federated Learning

Federated Learning is a training framework that enables multiple partici...
research
08/27/2021

Canoe : A System for Collaborative Learning for Neural Nets

For highly distributed environments such as edge computing, collaborativ...
research
03/23/2022

Contextual Model Aggregation for Fast and Robust Federated Learning in Edge Computing

Federated learning is a prime candidate for distributed machine learning...
research
05/07/2018

Soft Maximin Aggregation of Heterogeneous Array Data

The extraction of a common signal across many recordings is difficult wh...

Please sign up or login with your details

Forgot password? Click here to reset