Knowledge Distillation for Federated Learning: a Practical Guide

11/09/2022
by   Alessio Mora, et al.
0

Federated Learning (FL) enables the training of Deep Learning models without centrally collecting possibly sensitive raw data. This paves the way for stronger privacy guarantees when building predictive models. The most used algorithms for FL are parameter-averaging based schemes (e.g., Federated Averaging) that, however, have well known limits: (i) Clients must implement the same model architecture; (ii) Transmitting model weights and model updates implies high communication cost, which scales up with the number of model parameters; (iii) In presence of non-IID data distributions, parameter-averaging aggregation schemes perform poorly due to client model drifts. Federated adaptations of regular Knowledge Distillation (KD) can solve and/or mitigate the weaknesses of parameter-averaging FL algorithms while possibly introducing other trade-offs. In this article, we provide a review of KD-based algorithms tailored for specific FL issues.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/12/2020

Ensemble Distillation for Robust Model Fusion in Federated Learning

Federated Learning (FL) is a machine learning setting where many devices...
research
01/28/2022

Gradient Masked Averaging for Federated Learning

Federated learning is an emerging paradigm that permits a large number o...
research
04/01/2021

Decentralized and Model-Free Federated Learning: Consensus-Based Distillation in Function Space

This paper proposes a decentralized FL scheme for IoE devices connected ...
research
11/04/2020

Federated Knowledge Distillation

Distributed learning frameworks often rely on exchanging model parameter...
research
05/21/2023

One-Shot Federated Learning for LEO Constellations that Reduces Convergence Time from Days to 90 Minutes

A Low Earth orbit (LEO) satellite constellation consists of a large numb...
research
03/28/2022

Federated Learning with Position-Aware Neurons

Federated Learning (FL) fuses collaborative models from local nodes with...
research
02/12/2020

Federated Clustering via Matrix Factorization Models: From Model Averaging to Gradient Sharing

Recently, federated learning (FL) has drawn significant attention due to...

Please sign up or login with your details

Forgot password? Click here to reset