Increasing Adversarial Uncertainty to Scale Private Similarity Testing

09/03/2021
by   Yiqing Hua, et al.
0

Social media and other platforms rely on automated detection of abusive content to help combat disinformation, harassment, and abuse. One common approach is to check user content for similarity against a server-side database of problematic items. However, this method fundamentally endangers user privacy. Instead, we target client-side detection, notifying only the users when such matches occur to warn them against abusive content. Our solution is based on privacy-preserving similarity testing. Existing approaches rely on expensive cryptographic protocols that do not scale well to large databases and may sacrifice the correctness of the matching. To contend with this challenge, we propose and formalize the concept of similarity-based bucketization(SBB). With SBB, a client reveals a small amount of information to a database-holding server so that it can generate a bucket of potentially similar items. The bucket is small enough for efficient application of privacy-preserving protocols for similarity. To analyze the privacy risk of the revealed information, we introduce a framework for measuring an adversary's ability to infer a predicate about the client input with good confidence. We develop a practical SBB protocol for image content, and evaluate its client privacy guarantee with real-world social media data. We then combine SBB with various similarity protocols, showing that SBB provides a speedup of at least 29x on large-scale databases, while retaining correctness of over 95

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/14/2022

Private Set Matching Protocols

We introduce Private Set Matching (PSM) problems, in which a client aims...
research
08/30/2018

VirtualIdentity: Privacy-Preserving User Profiling

User profiling from user generated content (UGC) is a common practice th...
research
09/16/2021

PrivateFetch: Scalable Catalog Delivery in Privacy-Preserving Advertising

In order to preserve the possibility of an Internet that is free at the ...
research
01/22/2021

Understanding the Tradeoffs in Client-Side Privacy for Speech Recognition

Existing approaches to ensuring privacy of user speech data primarily fo...
research
06/07/2018

Privacy-Preserving Identification via Layered Sparse Code Design: Distributed Servers and Multiple Access Authorization

We propose a new computationally efficient privacy-preserving identifica...
research
05/23/2023

Mixup-Privacy: A simple yet effective approach for privacy-preserving segmentation

Privacy protection in medical data is a legitimate obstacle for centrali...
research
04/06/2023

Robust, privacy-preserving, transparent, and auditable on-device blocklisting

With the accelerated adoption of end-to-end encryption, there is an oppo...

Please sign up or login with your details

Forgot password? Click here to reset