Effcient logging and querying for Blockchain-based cross-site genomic dataset access audit

07/17/2019
by   Shuaicheng Ma, et al.
0

Background: Genomic data have been collected by different institutions and companies and need to be shared for broader use. In a cross-site genomic data sharing system, a secure and transparent access control audit module plays an essential role in ensuring the accountability. The 2018 iDASH competition first track provides us with an opportunity to design efficient logging and querying system for cross-site genomic dataset access audit. We designed a blockchain-based log system which can provide a light-weight and widely compatible module for existing blockchain platforms. The submitted solution won the third place of the competition. In this paper, we report the technical details in our system. Methods: We present two methods: baseline method and enhanced method. We started with the baseline method and then adjusted our implementation based on the competition evaluation criteria and characteristics of the log system. To overcome obstacles of indexing on the immutable Blockchain system, we designed a hierarchical timestamp structure which supports efficient range queries on the timestamp field. Results: We implemented our methods in Python3, tested the scalability, and compared the performance using the test data supplied by competition organizer. We successfully boosted the log retrieval speed for complex AND queries that contain multiple predicates. For the range query, we boosted the speed for at least one order of magnitude. The storage usage is reduced by 25 We demonstrate that Blockchain can be used to build a time and space efficient log and query genomic dataset audit trail. Therefore, it provides a promising solution for sharing genomic data with accountability requirement across multiple sites.

READ FULL TEXT
research
01/13/2020

Leveraging Blockchain for Immutable Logging and Querying Across Multiple Sites

Blockchain has emerged as a decentralized and distributed framework that...
research
02/10/2020

Medusa: Blockchain Powered Log Storage System

Blockchain is one of the most heavily invested technologies in recent ye...
research
07/10/2017

Dynamic Geodesic Nearest Neighbor Searching in a Simple Polygon

We present an efficient dynamic data structure that supports geodesic ne...
research
07/09/2020

A Consent Model for Blockchain-based Distributed Data Sharing Platforms

In modern healthcare systems, being able to share electronic health reco...
research
01/11/2019

Distributed Access Control with Blockchain

The specification and enforcement of network-wide policies in a single a...
research
02/07/2019

A Korean translation of "PAI Data, Summary of the Project PAI Data Protocol" by Jincheng Du, Dan Fang, Mark Harvilla

The Project PAI Data Protocol ("PAI Data") is a specification that exten...

Please sign up or login with your details

Forgot password? Click here to reset