Generative Models for Effective ML on Private, Decentralized Datasets

11/15/2019
by   Sean Augenstein, et al.
29

To improve real-world applications of machine learning, experienced modelers develop intuition about their datasets, their models, and how the two interact. Manual inspection of raw data - of representative samples, of outliers, of misclassifications - is an essential tool in a) identifying and fixing problems in the data, b) generating new modeling hypotheses, and c) assigning or refining human-provided labels. However, manual data inspection is problematic for privacy sensitive datasets, such as those representing the behavior of real-world individuals. Furthermore, manual data inspection is impossible in the increasingly important setting of federated learning, where raw examples are stored at the edge and the modeler may only access aggregated outputs such as metrics or model parameters. This paper demonstrates that generative models - trained using federated methods and with formal differential privacy guarantees - can be used effectively to debug many commonly occurring data issues even when the data cannot be directly inspected. We explore these methods in applications to text with differentially private federated RNNs and to images using a novel algorithm for differentially private federated GANs.

READ FULL TEXT

page 9

page 24

page 25

page 26

research
11/24/2019

Differentially Private Federated Variational Inference

In many real-world applications of machine learning, data are distribute...
research
01/08/2021

DiPSeN: Differentially Private Self-normalizing Neural Networks For Adversarial Robustness in Federated Learning

The need for robust, secure and private machine learning is an important...
research
09/12/2019

Differentially Private Meta-Learning

Parameter-transfer is a well-known and versatile approach for meta-learn...
research
04/10/2020

Decentralized Differentially Private Segmentation with PATE

When it comes to preserving privacy in medical machine learning, two imp...
research
02/08/2022

Practical Challenges in Differentially-Private Federated Survival Analysis of Medical Data

Survival analysis or time-to-event analysis aims to model and predict th...
research
11/21/2022

DPD-fVAE: Synthetic Data Generation Using Federated Variational Autoencoders With Differentially-Private Decoder

Federated learning (FL) is getting increased attention for processing se...
research
06/15/2020

GS-WGAN: A Gradient-Sanitized Approach for Learning Differentially Private Generators

The wide-spread availability of rich data has fueled the growth of machi...

Please sign up or login with your details

Forgot password? Click here to reset