PRIMO: Private Regression in Multiple Outcomes

03/07/2023
by   Seth Neel, et al.
0

We introduce a new differentially private regression setting we call Private Regression in Multiple Outcomes (PRIMO), inspired the common situation where a data analyst wants to perform a set of l regressions while preserving privacy, where the covariates X are shared across all l regressions, and each regression i ∈ [l] has a different vector of outcomes y_i. While naively applying private linear regression techniques l times leads to a √(l) multiplicative increase in error over the standard linear regression setting, in Subsection 4.1 we modify techniques based on sufficient statistics perturbation (SSP) to yield greatly improved dependence on l. In Subsection 4.2 we prove an equivalence to the problem of privately releasing the answers to a special class of low-sensitivity queries we call inner product queries. Via this equivalence, we adapt the geometric projection-based methods from prior work on private query release to the PRIMO setting. Under the assumption the labels Y are public, the projection gives improved results over the Gaussian mechanism when n < l√(d), with no asymptotic dependence on l in the error. In Subsection 4.3 we study the complexity of our projection algorithm, and analyze a faster sub-sampling based variant in Subsection 4.4. Finally in Section 5 we apply our algorithms to the task of private genomic risk prediction for multiple phenotypes using data from the 1000 Genomes project. We find that for moderately large values of l our techniques drastically improve the accuracy relative to both the naive baseline that uses existing private regression methods and our modified SSP algorithm that doesn't use the projection.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/29/2019

Differentially Private Bayesian Linear Regression

Linear regression is an important tool across many fields that work with...
research
02/19/2022

Differentially Private Regression with Unbounded Covariates

We provide computationally efficient, differentially private algorithms ...
research
03/11/2021

Differentially Private Query Release Through Adaptive Projection

We propose, implement, and evaluate a new algorithm for releasing answer...
research
06/16/2022

Differentially Private Multi-Party Data Release for Linear Regression

Differentially Private (DP) data release is a promising technique to dis...
research
03/06/2023

Improved Differentially Private Regression via Gradient Boosting

We revisit the problem of differentially private squared error linear re...
research
08/01/2023

Differentially Private Linear Regression with Linked Data

There has been increasing demand for establishing privacy-preserving met...
research
09/16/2022

Truthful Generalized Linear Models

In this paper we study estimating Generalized Linear Models (GLMs) in th...

Please sign up or login with your details

Forgot password? Click here to reset