Together or Alone: The Price of Privacy in Collaborative Learning

12/01/2017
by   Balazs Pejo, et al.
0

Machine Learning is a widely-used method for prediction generation. These predictions are more accurate when the model is trained on a larger dataset. On the other hand, the data is usually divided amongst different entities. For privacy reasons, the training can be done locally and then the model can be safely aggregated amongst the participants. However, if there are only two participants in Collaborative Learning, the safe aggregation loses its power since the output of the training already contains much information about the participants. To resolve this issue, they must employ privacy-preserving mechanisms, which inevitably affect the accuracy of the model. In this paper, we model the training process as a two-player game where each player aims to achieve a higher accuracy while preserving its privacy. We introduce the notion of Price of Privacy, a novel approach to measure the effect of privacy protection on the accuracy of the model. We develop a theoretical model for different player types, and we either find or prove the existence of a Nash Equilibrium with some assumptions. Moreover, we confirm these assumptions via a Recommendation Systems use case: for a specific learning algorithm, we apply three privacy-preserving mechanisms on two real-world datasets. Finally, as a complementary work for the designed game, we interpolate the relationship between privacy and accuracy for this use case and present three other methods to approximate it in a real-world scenario.

READ FULL TEXT
research
12/01/2017

Together or Alone: The Price of Privacy in Joint Learning

Machine Learning is a widely-used method for prediction generation. Thes...
research
03/16/2021

SoK: Privacy-Preserving Collaborative Tree-based Model Learning

Tree-based models are among the most efficient machine learning techniqu...
research
12/25/2018

Privacy-Preserving Collaborative Deep Learning with Irregular Participants

With large amounts of data collected from massive sensors, mobile users ...
research
11/21/2018

Privacy-Preserving Collaborative Prediction using Random Forests

We study the problem of privacy-preserving machine learning (PPML) for e...
research
04/13/2021

Fair and Differentially Private Distributed Frequency Estimation

In order to remain competitive, Internet companies collect and analyse u...
research
10/09/2019

Privacy-preserving and yet Robust Collaborative Filtering Recommender as a Service

Collaborative filtering recommenders provide effective personalization s...
research
01/31/2023

Privacy Preserving Ultra-Short-term Wind Power Prediction Based on Secure Multi Party Computation

Mining the spatial and temporal correlation of wind farm output data is ...

Please sign up or login with your details

Forgot password? Click here to reset