Accuracy Amplification in Differentially Private Logistic Regression: A Pre-Training Approach

07/25/2023
by   Mohammad Hoseinpour, et al.
0

Machine learning (ML) models can memorize training datasets. As a result, training ML models over private datasets can violate the privacy of individuals. Differential privacy (DP) is a rigorous privacy notion to preserve the privacy of underlying training datasets in ML models. Yet, training ML models in a DP framework usually degrades the accuracy of ML models. This paper aims to boost the accuracy of a DP-ML model, specifically a logistic regression model, via a pre-training module. In more detail, we initially pre-train our model on a public training dataset that there is no privacy concern about it. Then, we fine-tune our model via the DP logistic regression with the private dataset. In the numerical results, we show that adding a pre-training module significantly improves the accuracy of the DP logistic regression.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/24/2021

DP-UTIL: Comprehensive Utility Analysis of Differential Privacy in Machine Learning

Differential Privacy (DP) has emerged as a rigorous formalism to reason ...
research
12/06/2022

Straggler-Resilient Differentially-Private Decentralized Learning

We consider the straggler problem in decentralized learning over a logic...
research
11/24/2022

Differentially Private Image Classification from Features

Leveraging transfer learning has recently been shown to be an effective ...
research
07/01/2023

Saibot: A Differentially Private Data Search Platform

Recent data search platforms use ML task-based utility measures rather t...
research
05/23/2023

Selective Pre-training for Private Fine-tuning

Suppose we want to train text prediction models in email clients or word...
research
11/06/2021

On pseudo-absence generation and machine learning for locust breeding ground prediction in Africa

Desert locust outbreaks threaten the food security of a large part of Af...
research
09/19/2023

Striking a Balance: An Optimal Mechanism Design for Heterogenous Differentially Private Data Acquisition for Logistic Regression

We investigate the problem of performing logistic regression on data col...

Please sign up or login with your details

Forgot password? Click here to reset