DPpack: An R Package for Differentially Private Statistical Analysis and Machine Learning

09/19/2023
by   Spencer Giddens, et al.
0

Differential privacy (DP) is the state-of-the-art framework for guaranteeing privacy for individuals when releasing aggregated statistics or building statistical/machine learning models from data. We develop the open-source R package DPpack that provides a large toolkit of differentially private analysis. The current version of DPpack implements three popular mechanisms for ensuring DP: Laplace, Gaussian, and exponential. Beyond that, DPpack provides a large toolkit of easily accessible privacy-preserving descriptive statistics functions. These include mean, variance, covariance, and quantiles, as well as histograms and contingency tables. Finally, DPpack provides user-friendly implementation of privacy-preserving versions of logistic regression, SVM, and linear regression, as well as differentially private hyperparameter tuning for each of these models. This extensive collection of implemented differentially private statistics and models permits hassle-free utilization of differential privacy principles in commonly performed statistical analysis. We plan to continue developing DPpack and make it more comprehensive by including more differentially private machine learning techniques, statistical modeling and inference in the future.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/02/2016

Differentially Private Gaussian Processes

A major challenge for machine learning is increasing the availability of...
research
11/28/2022

Differentially Private Multivariate Statistics with an Application to Contingency Table Analysis

Differential privacy (DP) has become a rigorous central concept in priva...
research
08/01/2023

Differentially Private Linear Regression with Linked Data

There has been increasing demand for establishing privacy-preserving met...
research
06/30/2023

Differential Privacy May Have a Potential Optimization Effect on Some Swarm Intelligence Algorithms besides Privacy-preserving

Differential privacy (DP), as a promising privacy-preserving model, has ...
research
10/22/2021

A Feasibility Study of Differentially Private Summary Statistics and Regression Analyses for Administrative Tax Data

Federal administrative tax data are invaluable for research, but because...
research
12/30/2019

Differentially Private M-band Wavelet-Based Mechanisms in Machine Learning Environments

In the post-industrial world, data science and analytics have gained par...
research
03/18/2023

The Challenge of Differentially Private Screening Rules

Linear L_1-regularized models have remained one of the simplest and most...

Please sign up or login with your details

Forgot password? Click here to reset