An Evaluation of Open-source Tools for the Provision of Differential Privacy

02/19/2022
by   Shiliang Zhang, et al.
0

The concept of differential privacy has widely penetrated academia and industry, with its formal guarantee on individual privacy that leads to compliances with privacy legislation, e.g., GDPR. However, there is a lack of understanding on tools capable of achieving differential privacy, and it is not clear what to expect from existing differential privacy tools when implementing privacy protection. Such an obstacle limits private applications' further prosperity. This paper reviews and evaluates the state-of-the-art open-source differential privacy tools of different domains using various estimating categories and privacy settings. Particularly, we look into the performances of three differential privacy tools for machine learning, two for statistical query, and four for synthetic data generation. We test all the tools on both continuous and categorical data and quantify their performance under different privacy budget and data size w.r.t. utility loss and system overhead. The accumulated evaluation results reveal several patterns that users can follow to optimally configure the tools, and provide preliminary guidelines on tool selection under different criteria. Finally, we openly release our evaluation coding repository, a framework that users can reuse to further evaluate the studied tools and beyond. We anticipate this work to provide a comprehensive insight into the performances of the existing dominant privacy tools, and a concrete reference for a potentially large developer community on private applications, thus narrowing the gap between conceptual differential privacy and private functionality development.

READ FULL TEXT

page 19

page 22

page 24

page 27

page 29

page 34

page 38

page 42

research
11/07/2022

Lessons Learned: Surveying the Practicality of Differential Privacy in the Industry

Since its introduction in 2006, differential privacy has emerged as a pr...
research
09/22/2018

Understanding Tor Usage with Privacy-Preserving Measurement

The Tor anonymity network is difficult to measure because, if not done c...
research
01/27/2021

Randori: Local Differential Privacy for All

Polls are a common way of collecting data, including product reviews and...
research
02/17/2021

Differential Privacy for Government Agencies – Are We There Yet?

Government agencies always need to carefully consider potential risks of...
research
06/15/2020

Transparent Privacy is Principled Privacy

Differential privacy revolutionizes the way we think about statistical d...
research
04/20/2022

Private measures, random walks, and synthetic data

Differential privacy is a mathematical concept that provides an informat...
research
09/22/2021

Do I Get the Privacy I Need? Benchmarking Utility in Differential Privacy Libraries

An increasing number of open-source libraries promise to bring different...

Please sign up or login with your details

Forgot password? Click here to reset