Just-in-Time Aggregation for Federated Learning

08/20/2022
by   K. R. Jayaram, et al.
0

The increasing number and scale of federated learning (FL) jobs necessitates resource efficient scheduling and management of aggregation to make the economics of cloud-hosted aggregation work. Existing FL research has focused on the design of FL algorithms and optimization, and less on the efficacy of aggregation. Existing FL platforms often employ aggregators that actively wait for model updates. This wastes computational resources on the cloud, especially in large scale FL settings where parties are intermittently available for training. In this paper, we propose a new FL aggregation paradigm – "just-in-time" (JIT) aggregation that leverages unique properties of FL jobs, especially the periodicity of model updates, to defer aggregation as much as possible and free compute resources for other FL jobs or other datacenter workloads. We describe a novel way to prioritize FL jobs for aggregation, and demonstrate using multiple datasets, models and FL aggregation algorithms that our techniques can reduce resource usage by 60+% when compared to eager aggregation used in existing FL platforms. We also demonstrate that using JIT aggregation has negligible overhead and impact on the latency of the FL job.

READ FULL TEXT
research
03/23/2022

Adaptive Aggregation For Federated Learning

Advances in federated learning (FL) algorithms,along with technologies l...
research
05/19/2021

Separation of Powers in Federated Learning

Federated Learning (FL) enables collaborative training among mutually di...
research
01/28/2021

Covert Model Poisoning Against Federated Learning: Algorithm Design and Optimization

Federated learning (FL), as a type of distributed machine learning frame...
research
09/22/2021

In-network Computation for Large-scale Federated Learning over Wireless Edge Networks

Most conventional Federated Learning (FL) models are using a star networ...
research
11/24/2022

Multi-Job Intelligent Scheduling with Cross-Device Federated Learning

Recent years have witnessed a large amount of decentralized data in vari...
research
06/14/2021

Dynamic Gradient Aggregation for Federated Domain Adaptation

In this paper, a new learning algorithm for Federated Learning (FL) is i...
research
12/11/2021

Efficient Device Scheduling with Multi-Job Federated Learning

Recent years have witnessed a large amount of decentralized data in mult...

Please sign up or login with your details

Forgot password? Click here to reset