The Challenge of Differentially Private Screening Rules

03/18/2023
by   Amol Khanna, et al.
0

Linear L_1-regularized models have remained one of the simplest and most effective tools in data analysis, especially in information retrieval problems where n-grams over text with TF-IDF or Okapi feature values are a strong and easy baseline. Over the past decade, screening rules have risen in popularity as a way to reduce the runtime for producing the sparse regression weights of L_1 models. However, despite the increasing need of privacy-preserving models in information retrieval, to the best of our knoweledge, no differentially private screening rule exists. In this paper, we develop the first differentially private screening rule for linear and logistic regression. In doing so, we discover difficulties in the task of making a useful private screening rule due to the amount of noise added to ensure privacy. We provide theoretical arguments and experimental evidence that this difficulty arises from the screening step itself and not the private optimizer. Based on our results, we highlight that developing an effective private L_1 screening method is an open problem in the differential privacy literature.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/13/2022

Differentially Private Tree-Based Redescription Mining

Differential privacy provides a strong form of privacy and allows preser...
research
09/19/2023

DPpack: An R Package for Differentially Private Statistical Analysis and Machine Learning

Differential privacy (DP) is the state-of-the-art framework for guarante...
research
08/05/2021

Differentially Private n-gram Extraction

We revisit the problem of n-gram extraction in the differential privacy ...
research
07/30/2014

Differentially-Private Logistic Regression for Detecting Multiple-SNP Association in GWAS Databases

Following the publication of an attack on genome-wide association studie...
research
04/27/2021

The Hessian Screening Rule

Predictor screening rules, which discard predictors from the design matr...
research
04/10/2019

What Storage Access Privacy is Achievable with Small Overhead?

Oblivious RAM (ORAM) and private information retrieval (PIR) are classic...
research
03/22/2010

Development of a Cargo Screening Process Simulator: A First Approach

The efficiency of current cargo screening processes at sea and air ports...

Please sign up or login with your details

Forgot password? Click here to reset