Privacy-preserving record linkage using local sensitive hash and private set intersection

03/27/2022
by   Allon Adir, et al.
0

The amount of data stored in data repositories increases every year. This makes it challenging to link records between different datasets across companies and even internally, while adhering to privacy regulations. Address or name changes, and even different spelling used for entity data, can prevent companies from using private deduplication or record-linking solutions such as private set intersection (PSI). To this end, we propose a new and efficient privacy-preserving record linkage (PPRL) protocol that combines PSI and local sensitive hash (LSH) functions, and runs in linear time. We explain the privacy guarantees that our protocol provides and demonstrate its practicality by executing the protocol over two datasets with 2^20 records each, in 11-45 minutes, depending on network settings.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/12/2022

Privacy-Preserving Record Linkage

Given several databases containing person-specific data held by differen...
research
11/18/2020

Asymmetric Private Set Intersection with Applications to Contact Tracing and Private Vertical Federated Machine Learning

We present a multi-language, cross-platform, open-source library for asy...
research
02/22/2018

Options for encoding names for data linking at the Australian Bureau of Statistics

Publicly, ABS has said it would use a cryptographic hash function to con...
research
05/29/2020

Datashare: A Decentralized Privacy-Preserving Search Engine for Investigative Journalists

Investigative journalists collect large numbers of digital documents dur...
research
06/14/2023

Privacy-Preserving Password Cracking: How a Third Party Can Crack Our Password Hash Without Learning the Hash Value or the Cleartext

Using the computational resources of an untrusted third party to crack a...
research
08/08/2023

The Still Secret Ballot: The Limited Privacy Cost of Transparent Election Results

After an election, should election officials release an electronic recor...
research
08/07/2023

Labeling without Seeing? Blind Annotation for Privacy-Preserving Entity Resolution

The entity resolution problem requires finding pairs across datasets tha...

Please sign up or login with your details

Forgot password? Click here to reset