Privacy-Preserving Multiparty Learning For Logistic Regression

10/04/2018
by   Wei Du, et al.
0

In recent years, machine learning techniques are widely used in numerous applications, such as weather forecast, financial data analysis, spam filtering, and medical prediction. In the meantime, massive data generated from multiple sources further improve the performance of machine learning tools. However, data sharing from multiple sources brings privacy issues for those sources since sensitive information may be leaked in this process. In this paper, we propose a framework enabling multiple parties to collaboratively and accurately train a learning model over distributed datasets while guaranteeing the privacy of data sources. Specifically, we consider logistic regression model for data training and propose two approaches for perturbing the objective function to preserve ϵ-differential privacy. The proposed solutions are tested on real datasets, including Bank Marketing and Credit Card Default prediction. Experimental results demonstrate that the proposed multiparty learning framework is highly efficient and accurate.

READ FULL TEXT
research
05/14/2021

Privacy-preserving Logistic Regression with Secret Sharing

Logistic regression (LR) is a widely used classification method for mode...
research
09/18/2023

Online Efficient Secure Logistic Regression based on Function Secret Sharing

Logistic regression is an algorithm widely used for binary classificatio...
research
07/17/2018

Efficient Deep Learning on Multi-Source Private Data

Machine learning models benefit from large and diverse datasets. Using s...
research
09/12/2023

Privacy-Preserving Linkage of Distributed Datasets using the Personal Health Train

With the generation of personal and medical data at several locations, m...
research
01/15/2023

A Coreset Learning Reality Check

Subsampling algorithms are a natural approach to reduce data size before...
research
11/03/2020

A Scalable Approach for Privacy-Preserving Collaborative Machine Learning

We consider a collaborative learning scenario in which multiple data-own...
research
02/11/2019

Drynx: Decentralized, Secure, Verifiable System for Statistical Queries and Machine Learning on Distributed Datasets

Data sharing has become of primary importance in many domains such as bi...

Please sign up or login with your details

Forgot password? Click here to reset