Distributionally-Robust Machine Learning Using Locally Differentially-Private Data

06/24/2020
by   Farhad Farokhi, et al.
12

We consider machine learning, particularly regression, using locally-differentially private datasets. The Wasserstein distance is used to define an ambiguity set centered at the empirical distribution of the dataset corrupted by local differential privacy noise. The ambiguity set is shown to contain the probability distribution of unperturbed, clean data. The radius of the ambiguity set is a function of the privacy budget, spread of the data, and the size of the problem. Hence, machine learning with locally-differentially private datasets can be rewritten as a distributionally-robust optimization. For general distributions, the distributionally-robust optimization problem can relaxed as a regularized machine learning problem with the Lipschitz constant of the machine learning model as a regularizer. For linear and logistic regression, this regularizer is the dual norm of the model parameters. For Gaussian data, the distributionally-robust optimization problem can be solved exactly to find an optimal regularizer. This approach results in an entirely new regularizer for training linear regression models. Training with this novel regularizer can be posed as a semi-definite program. Finally, the performance of the proposed distributionally-robust machine learning training is demonstrated on practical datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/29/2020

Regularization Helps with Mitigating Poisoning Attacks: Distributionally-Robust Machine Learning Using the Wasserstein Distance

We use distributionally-robust optimization for machine learning to miti...
research
06/10/2020

Robustified Multivariate Regression and Classification Using Distributionally Robust Optimization under the Wasserstein Metric

We develop Distributionally Robust Optimization (DRO) formulations for M...
research
04/16/2020

Differentially Private Linear Regression over Fully Decentralized Datasets

This paper presents a differentially private algorithm for linear regres...
research
10/15/2022

Distributionally Robust Multiclass Classification and Applications in Deep Image Classifiers

We develop a Distributionally Robust Optimization (DRO) formulation for ...
research
09/27/2021

Distributionally Robust Multiclass Classification and Applications in Deep CNN Image Classifiers

We develop a Distributionally Robust Optimization (DRO) formulation for ...
research
10/05/2022

Learning from aggregated data with a maximum entropy model

Aggregating a dataset, then injecting some noise, is a simple and common...
research
01/14/2020

Private Machine Learning via Randomised Response

We introduce a general learning framework for private machine learning b...

Please sign up or login with your details

Forgot password? Click here to reset