Sharing in a Trustless World: Privacy-Preserving Data Analytics with Potentially Cheating Participants

06/18/2021
by   Tham Nguyen, et al.
0

Lack of trust between organisations and privacy concerns about their data are impediments to an otherwise potentially symbiotic joint data analysis. We propose DataRing, a data sharing system that allows mutually mistrusting participants to query each others' datasets in a privacy-preserving manner while ensuring the correctness of input datasets and query answers even in the presence of (cheating) participants deviating from their true datasets. By relying on the assumption that if only a small subset of rows of the true dataset are known, participants cannot submit answers to queries deviating significantly from their true datasets. We employ differential privacy and a suite of cryptographic tools to ensure individual privacy for each participant's dataset and data confidentiality from the system. Our results show that the evaluation of 10 queries on a dataset with 10 attributes and 500,000 records is achieved in 90.63 seconds. DataRing could detect cheating participant that deviates from its true dataset in few queries with high accuracy.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/30/2021

GenShare: Sharing Accurate Differentially-Private Statistics for Genomic Datasets with Dependent Tuples

Motivation: Cutting the cost of DNA sequencing technology led to a quant...
research
01/06/2018

Privacy-Preserving Aggregate Queries for Optimal Location Selection

Today, vast amounts of location data are collected by various service pr...
research
12/25/2018

Privacy-Preserving Collaborative Deep Learning with Irregular Participants

With large amounts of data collected from massive sensors, mobile users ...
research
11/13/2022

Comprehension from Chaos: What Users Understand and Expect from Private Computation

Private computation, which includes techniques like multi-party computat...
research
12/01/2017

Together or Alone: The Price of Privacy in Joint Learning

Machine Learning is a widely-used method for prediction generation. Thes...
research
02/11/2019

Drynx: Decentralized, Secure, Verifiable System for Statistical Queries and Machine Learning on Distributed Datasets

Data sharing has become of primary importance in many domains such as bi...
research
07/05/2020

Octopus: Privacy-Preserving Collaborative Evaluation of Loan Stacking

With the rise of online lenders, the loan stacking problem has become a ...

Please sign up or login with your details

Forgot password? Click here to reset