Continual and Sliding Window Release for Private Empirical Risk Minimization

03/07/2022
by   Lauren Watson, et al.
4

It is difficult to continually update private machine learning models with new data while maintaining privacy. Data incur increasing privacy loss – as measured by differential privacy – when they are used in repeated computations. In this paper, we describe regularized empirical risk minimization algorithms that continually release models for a recent window of data. One version of the algorithm uses the entire data history to improve the model for the recent window. The second version uses a sliding window of constant size to improve the model, ensuring more relevant models in case of evolving data. The algorithms operate in the framework of stochastic gradient descent. We prove that even with releasing a model at each time-step over an infinite time horizon, the privacy cost of any data point is bounded by a constant ϵ differential privacy, and the accuracy of the output models are close to optimal. Experiments on MNIST and Arxiv publications data show results consistent with the theory.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/25/2020

Stability Enhanced Privacy and Applications in Private Stochastic Gradient Descent

Private machine learning involves addition of noise while training, resu...
research
05/05/2023

Differentially-private Continual Releases against Dynamic Databases

Prior research primarily examined differentially-private continual relea...
research
07/11/2019

Amplifying Rényi Differential Privacy via Shuffling

Differential privacy is a useful tool to build machine learning models w...
research
09/18/2019

Renyi Differentially Private ADMM Based L1 Regularized Classification

In this paper we present two new algorithms, to solve the L1 regularized...
research
01/18/2021

On the Differentially Private Nature of Perturbed Gradient Descent

We consider the problem of empirical risk minimization given a database,...
research
01/22/2021

The Privacy-Utility Tradeoff of Robust Local Differential Privacy

We consider data release protocols for data X=(S,U), where S is sensitiv...
research
11/21/2019

S-RASTER: Contraction Clustering for Evolving Data Streams

Contraction Clustering (RASTER) is a very fast algorithm for density-bas...

Please sign up or login with your details

Forgot password? Click here to reset