Private data sharing between decentralized users through the privGAN architecture

09/14/2020
by   Jean-Francois Rajotte, et al.
0

More data is almost always beneficial for analysis and machine learning tasks. In many realistic situations however, an enterprise cannot share its data, either to keep a competitive advantage or to protect the privacy of the data sources, the enterprise's clients for example. We propose a method for data owners to share synthetic or fake versions of their data without sharing the actual data, nor the parameters of models that have direct access to the data. The method proposed is based on the privGAN architecture where local GANs are trained on their respective data subsets with an extra penalty from a central discriminator aiming to discriminate the origin of a given fake sample. We demonstrate that this approach, when applied to subsets of various sizes, leads to better utility for the owners than the utility from their real small datasets. The only shared pieces of information are the parameter updates of the central discriminator. The privacy is demonstrated with white-box attacks on the most vulnerable elments of the architecture and the results are close to random guessing. This method would apply naturally in a federated learning setting.

READ FULL TEXT
research
09/24/2021

A Generative Federated Learning Framework for Differential Privacy

In machine learning, differential privacy and federated learning concept...
research
01/18/2021

Reducing bias and increasing utility by federated generative modeling of medical images using a centralized adversary

We introduce FELICIA (FEderated LearnIng with a CentralIzed Adversary) a...
research
10/20/2020

Mitigating Sybil Attacks on Differential Privacy based Federated Learning

In federated learning, machine learning and deep learning models are tra...
research
02/03/2023

GTV: Generating Tabular Data via Vertical Federated Learning

Generative Adversarial Networks (GANs) have achieved state-of-the-art re...
research
05/06/2021

Membership Inference Attacks on Deep Regression Models for Neuroimaging

Ensuring the privacy of research participants is vital, even more so in ...
research
08/21/2018

MobilityMirror: Bias-Adjusted Transportation Datasets

We describe customized synthetic datasets for publishing mobility data. ...
research
04/14/2021

The Role of Cross-Silo Federated Learning in Facilitating Data Sharing in the Agri-Food Sector

Data sharing remains a major hindering factor when it comes to adopting ...

Please sign up or login with your details

Forgot password? Click here to reset