Differentially Private ERM Based on Data Perturbation

02/20/2020
by   Yilin Kang, et al.
7

In this paper, after observing that different training data instances affect the machine learning model to different extents, we attempt to improve the performance of differentially private empirical risk minimization (DP-ERM) from a new perspective. Specifically, we measure the contributions of various training data instances on the final machine learning model, and select some of them to add random noise. Considering that the key of our method is to measure each data instance separately, we propose a new `Data perturbation' based (DB) paradigm for DP-ERM: adding random noise to the original training data and achieving (ϵ,δ)-differential privacy on the final machine learning model, along with the preservation on the original data. By introducing the Influence Function (IF), we quantitatively measure the impact of the training data on the final model. Theoretical and experimental results show that our proposed DBDP-ERM paradigm enhances the model performance significantly.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/20/2020

Input Perturbation: A New Paradigm between Central and Local Differential Privacy

Traditionally, there are two models on differential privacy: the central...
research
08/22/2020

On the Intrinsic Differential Privacy of Bagging

Differentially private machine learning trains models while protecting p...
research
03/02/2021

DP-InstaHide: Provably Defusing Poisoning and Backdoor Attacks with Differentially Private Data Augmentations

Data poisoning and backdoor attacks manipulate training data to induce s...
research
10/04/2022

Recycling Scraps: Improving Private Learning by Leveraging Intermediate Checkpoints

All state-of-the-art (SOTA) differentially private machine learning (DP ...
research
01/03/2023

Differentially Private Federated Clustering over Non-IID Data

Federated clustering (FedC) is an adaptation of centralized clustering i...
research
02/21/2023

Valid Inference for Machine Learning Model Parameters

The parameters of a machine learning model are typically learned by mini...

Please sign up or login with your details

Forgot password? Click here to reset