Datashare: A Decentralized Privacy-Preserving Search Engine for Investigative Journalists

by   Kasra EdalatNejad, et al.

Investigative journalists collect large numbers of digital documents during their investigations. These documents could greatly benefit other journalists' work. However, many of these documents contain sensitive information and their possession of such documents can endanger reporters, their stories, and their sources. Thus, many documents are only used only for single, local, investigations. We present Datashare, a decentralized and privacy-preserving global search system that enables journalists worldwide to find documents via a dedicated network of peers. Datashare combines well-known anonymous authentication mechanisms and anonymous communication primitives, a novel asynchronous messaging system, and a novel multi-set private set intersection protocol (MS-PSI) into a decentralized peer-to-peer private document search engine. We show that Datashare is secure and scales to thousands of users and millions of documents using a prototype implementation.


page 1

page 2

page 3

page 4


Privacy-Preserving Multi-Document Summarization

State-of-the-art extractive multi-document summarization systems are usu...

Safepaths: Vaccine Diary Protocol and Decentralized Vaccine Coordination System using a Privacy Preserving User Centric Experience

In this early draft, we present an end-to-end decentralized protocol for...

Privacy-preserving record linkage using local sensitive hash and private set intersection

The amount of data stored in data repositories increases every year. Thi...

Biscotti: A Ledger for Private and Secure Peer-to-Peer Machine Learning

Centralized solutions for privacy-preserving multi-party ML are becoming...

Strategies and Perceived Risks of Sending Sensitive Documents

People are frequently required to send documents, forms, or other materi...

An Effective Privacy-Preserving Data Coding in Peer-To-Peer Network

Coding Opportunistically (COPE) is a simple but very effective data codi...

Secure Friend Discovery via Privacy-Preserving and Decentralized Community Detection

The problem of secure friend discovery on a social network has long been...