Jointly Learning from Decentralized (Federated) and Centralized Data to Mitigate Distribution Shift

11/23/2021
by   Sean Augenstein, et al.
0

With privacy as a motivation, Federated Learning (FL) is an increasingly used paradigm where learning takes place collectively on edge devices, each with a cache of user-generated training examples that remain resident on the local device. These on-device training examples are gathered in situ during the course of users' interactions with their devices, and thus are highly reflective of at least part of the inference data distribution. Yet a distribution shift may still exist; the on-device training examples may lack for some data inputs expected to be encountered at inference time. This paper proposes a way to mitigate this shift: selective usage of datacenter data, mixed in with FL. By mixing decentralized (federated) and centralized (datacenter) data, we can form an effective training data distribution that better matches the inference data distribution, resulting in more useful models while still meeting the private training data access constraints imposed by FL.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/26/2022

Mixed Federated Learning: Joint Decentralized and Centralized Learning

Federated learning (FL) enables learning from decentralized privacy-sens...
research
08/22/2021

Flexible Clustered Federated Learning for Client-Level Data Distribution Shift

Federated Learning (FL) enables the multiple participating devices to co...
research
07/16/2021

AutoFL: Enabling Heterogeneity-Aware Energy Efficient Federated Learning

Federated learning enables a cluster of decentralized mobile devices at ...
research
03/13/2023

Cross-device Federated Learning for Mobile Health Diagnostics: A First Study on COVID-19 Detection

Federated learning (FL) aided health diagnostic models can incorporate d...
research
02/08/2022

Learnings from Federated Learning in the Real world

Federated Learning (FL) applied to real world data may suffer from sever...
research
04/17/2022

WhyGen: Explaining ML-powered Code Generation by Referring to Training Examples

Deep learning has demonstrated great abilities in various code generatio...
research
05/11/2021

Federated Unbiased Learning to Rank

Unbiased Learning to Rank (ULTR) studies the problem of learning a ranki...

Please sign up or login with your details

Forgot password? Click here to reset