SciChain: Trustworthy Scientific Data Provenance

02/01/2020
by   Abdullah-Al Mamun, et al.
0

The state-of-the-art for auditing and reproducing scientific applications on high-performance computing (HPC) systems is through a data provenance subsystem. While recent advances in data provenance lie in reducing the performance overhead and improving the user's query flexibility, the fidelity of data provenance is often overlooked: there is no such a way to ensure that the provenance data itself has not been fabricated or falsified. This paper advocates to leverage blockchains to deliver immutable and autonomous data provenance services such that scientific data are trustworthy. The challenges for adopting blockchains to HPC include designing a new blockchain architecture compatible with the HPC platforms and, more importantly, a set of new consensus protocols for scientific applications atop blockchains. To this end, we have designed the proof-of-scalable-traceability (POST) protocol and implemented it in a blockchain prototype, namely SciChain, the very first blockchain system for HPC. We evaluated SciChain by comparing it with multiple state-of-the-art systems; Experimental results showed that SciChain guaranteed trustworthy data while incurring orders of magnitude lower overhead.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/20/2020

BAASH: Enabling Blockchain-as-a-Service on High-Performance Computing Systems

The state-of-the-art approach to manage blockchains is to process blocks...
research
08/02/2023

PROV-IO+: A Cross-Platform Provenance Framework for Scientific Data on HPC Systems

Data provenance, or data lineage, describes the life cycle of data. In s...
research
01/10/2023

Exploring the Use of WebAssembly in HPC

Containerization approaches based on namespaces offered by the Linux ker...
research
02/07/2022

ALDER: Unlocking blockchain performance by multiplexing consensus protocols

Most of today's online services (e.g., social networks, search engines, ...
research
09/15/2022

Towards Interoperability of Open and Permissionless Blockchains: A Cross-Chain Query Language

The rise of open and permissionless blockchains has introduced novel pla...
research
08/01/2020

DeACT: Architecture-Aware Virtual Memory Support for Fabric Attached Memory Systems

The exponential growth of data has driven technology providers to develo...
research
06/12/2019

Application-Level Differential Checkpointing for HPC Applications with Dynamic Datasets

High-performance computing (HPC) requires resilience techniques such as ...

Please sign up or login with your details

Forgot password? Click here to reset