Understanding Unintended Memorization in Federated Learning

06/12/2020
by   Om Thakkar, et al.
0

Recent works have shown that generative sequence models (e.g., language models) have a tendency to memorize rare or unique sequences in the training data. Since useful models are often trained on sensitive data, to ensure the privacy of the training data it is critical to identify and mitigate such unintended memorization. Federated Learning (FL) has emerged as a novel framework for large-scale distributed learning tasks. However, it differs in many aspects from the well-studied central learning setting where all the data is stored at the central server. In this paper, we initiate a formal study to understand the effect of different components of canonical FL on unintended memorization in trained models, comparing with the central learning setting. Our results show that several differing components of FL play an important role in reducing unintended memorization. Specifically, we observe that the clustering of data according to users—which happens by design in FL—has a significant effect in reducing such memorization, and using the method of Federated Averaging for training causes a further reduction. We also show that training with a strong user-level differential privacy guarantee results in models that exhibit the least amount of unintended memorization.

READ FULL TEXT
research
07/22/2023

Security and Privacy Issues of Federated Learning

Federated Learning (FL) has emerged as a promising approach to address d...
research
05/06/2022

Federated Learning with Noisy User Feedback

Machine Learning (ML) systems are getting increasingly popular, and driv...
research
09/20/2020

When Federated Learning Meets Blockchain: A New Distributed Learning Paradigm

Motivated by the advancing computational capabilities of wireless end us...
research
10/27/2019

Federated Uncertainty-Aware Learning for Distributed Hospital EHR Data

Recent works have shown that applying Machine Learning to Electronic Hea...
research
05/15/2023

FLARE: Detection and Mitigation of Concept Drift for Federated Learning based IoT Deployments

Intelligent, large-scale IoT ecosystems have become possible due to rece...
research
07/14/2021

Federated Mixture of Experts

Federated learning (FL) has emerged as the predominant approach for coll...
research
01/26/2023

SuperFed: Weight Shared Federated Learning

Federated Learning (FL) is a well-established technique for privacy pres...

Please sign up or login with your details

Forgot password? Click here to reset