Interval Privacy: A Framework for Data Collection

06/17/2021
by   Jie Ding, et al.
0

The emerging public awareness and government regulations of data privacy motivate new paradigms of collecting and analyzing data transparent and acceptable to data owners. We present a new concept of privacy and corresponding data formats, mechanisms, and tradeoffs for privatizing data during data collection. The privacy, named Interval Privacy, enforces the raw data conditional distribution on the privatized data to be the same as its unconditional distribution over a nontrivial support set. Correspondingly, the proposed privacy mechanism will record each data value as a random interval containing it. The proposed interval privacy mechanisms can be easily deployed through most existing survey-based data collection paradigms, e.g., by asking a respondent whether its data value is within a randomly generated range. Another unique feature of interval mechanisms is that they obfuscate the truth but not distort it. The way of using narrowed range to convey information is complementary to the popular paradigm of perturbing data. Also, the interval mechanisms can generate progressively refined information at the discretion of individual respondents. We study different theoretical aspects of the proposed privacy. In the context of supervised learning, we also offer a method such that existing supervised learning algorithms designed for point-valued data could be directly applied to learning from interval-valued data.

READ FULL TEXT

page 20

page 21

research
02/24/2016

Discrete Distribution Estimation under Local Privacy

The collection and analysis of user data drives improvements in the app ...
research
03/10/2022

Facilitating Federated Genomic Data Analysis by Identifying Record Correlations while Ensuring Privacy

With the reduction of sequencing costs and the pervasiveness of computin...
research
04/06/2020

Can Two Walk Together: Privacy Enhancing Methods and Preventing Tracking of Users

We present a new concern when collecting data from individuals that aris...
research
07/18/2023

Trajectory Data Collection with Local Differential Privacy

Trajectory data collection is a common task with many applications in ou...
research
12/05/2017

Collecting Telemetry Data Privately

The collection and analysis of telemetry data from users' devices is rou...
research
02/17/2023

More Data Types More Problems: A Temporal Analysis of Complexity, Stability, and Sensitivity in Privacy Policies

Collecting personally identifiable information (PII) on data subjects ha...
research
07/02/2021

Subset Privacy: Draw from an Obfuscated Urn

With the rapidly increasing ability to collect and analyze personal data...

Please sign up or login with your details

Forgot password? Click here to reset