Substra: a framework for privacy-preserving, traceable and collaborative Machine Learning

10/25/2019
by   Mathieu N Galtier, et al.
0

Machine learning is promising, but it often needs to process vast amounts of sensitive data which raises concerns about privacy. In this white-paper, we introduce Substra, a distributed framework for privacy-preserving, traceable and collaborative Machine Learning. Substra gathers data providers and algorithm designers into a network of nodes that can train models on demand but under advanced permission regimes. To guarantee data privacy, Substra implements distributed learning: the data never leave their nodes; only algorithms, predictive models and non-sensitive metadata are exchanged on the network. The computations are orchestrated by a Distributed Ledger Technology which guarantees traceability and authenticity of information without needing to trust a third party. Although originally developed for Healthcare applications, Substra is not data, algorithm or programming language specific. It supports many types of computation plans including parallel computation plan commonly used in Federated Learning. With appropriate guidelines, it can be deployed for numerous Machine Learning use-cases with data or algorithm providers where trust is limited.

READ FULL TEXT
research
03/29/2021

Privacy and Trust Redefined in Federated Machine Learning

A common privacy issue in traditional machine learning is that data need...
research
06/03/2020

A Distributed Trust Framework for Privacy-Preserving Machine Learning

When training a machine learning model, it is standard procedure for the...
research
04/11/2018

A Management Framework for Secure Multiparty Computation in Dynamic Environments

Secure multiparty computation (SMC) is a promising technology for privac...
research
05/14/2023

Privacy-Preserving Taxi-Demand Prediction Using Federated Learning

Taxi-demand prediction is an important application of machine learning t...
research
06/05/2023

A Privacy-Preserving Federated Learning Approach for Kernel methods

It is challenging to implement Kernel methods, if the data sources are d...
research
12/21/2021

Distributed Machine Learning and the Semblance of Trust

The utilisation of large and diverse datasets for machine learning (ML) ...
research
09/22/2020

Privacy Preserving K-Means Clustering: A Secure Multi-Party Computation Approach

Knowledge discovery is one of the main goals of Artificial Intelligence....

Please sign up or login with your details

Forgot password? Click here to reset