ProML: A Decentralised Platform for Provenance Management of Machine Learning Software Systems

06/21/2022
by   Nguyen Khoi Tran, et al.
0

Large-scale Machine Learning (ML) based Software Systems are increasingly developed by distributed teams situated in different trust domains. Insider threats can launch attacks from any domain to compromise ML assets (models and datasets). Therefore, practitioners require information about how and by whom ML assets were developed to assess their quality attributes such as security, safety, and fairness. Unfortunately, it is challenging for ML teams to access and reconstruct such historical information of ML assets (ML provenance) because it is generally fragmented across distributed ML teams and threatened by the same adversaries that attack ML assets. This paper proposes ProML, a decentralised platform that leverages blockchain and smart contracts to empower distributed ML teams to jointly manage a single source of truth about circulated ML assets' provenance without relying on a third party, which is vulnerable to insider threats and presents a single point of failure. We propose a novel architectural approach called Artefact-as-a-State-Machine to leverage blockchain transactions and smart contracts for managing ML provenance information and introduce a user-driven provenance capturing mechanism to integrate existing scripts and tools to ProML without compromising participants' control over their assets and toolchains. We evaluate the performance and overheads of ProML by benchmarking a proof-of-concept system on a global blockchain. Furthermore, we assessed ProML's security against a threat model of a distributed ML workflow.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/01/2020

EVMPatch: Timely and Automated Patching of Ethereum Smart Contracts

Recent attacks exploiting errors in smart contract code had devastating ...
research
05/23/2019

Modeling Smart Contracts Activities: A Tensor Based Approach

Smart contracts are autonomous software executing predefined conditions....
research
05/04/2020

Formal Verification of Solidity contracts in Event-B

Smart contracts are the artifact of the blockchain that provide immutabl...
research
06/19/2023

Blockchain-Enabled Federated Learning: A Reference Architecture Incorporating a DID Access System

Recently, Blockchain-Enabled Federated Learning (BCFL), an innovative ap...
research
04/25/2019

d-MABE: Distributed Multilevel Attribute-Based EMR Management and Applications

Current systems used by medical institutions for the management and tran...
research
03/17/2021

Extending SOUP to ML Models When DesigningCertified Medical Systems

Software of Unknown Provenance, SOUP, refers to a software component tha...
research
04/17/2023

AICons: An AI-Enabled Consensus Algorithm Driven by Energy Preservation and Fairness

Blockchain has been used in several domains. However, this technology st...

Please sign up or login with your details

Forgot password? Click here to reset