Bringing the Algorithms to the Data – Secure Distributed Medical Analytics using the Personal Health Train (PHT-meDIC)

The need for data privacy and security – enforced through increasingly strict data protection regulations – renders the use of healthcare data for machine learning difficult. In particular, the transfer of data between different hospitals is often not permissible and thus cross-site pooling of data not an option. The Personal Health Train (PHT) paradigm proposed within the GO-FAIR initiative implements an 'algorithm to the data' paradigm that ensures that distributed data can be accessed for analysis without transferring any sensitive data. We present PHT-meDIC, a productively deployed open-source implementation of the PHT concept. Containerization allows us to easily deploy even complex data analysis pipelines (e.g, genomics, image analysis) across multiple sites in a secure and scalable manner. We discuss the underlying technological concepts, security models, and governance processes. The implementation has been successfully applied to distributed analyses of large-scale data, including applications of deep neural networks to medical image data.

READ FULL TEXT
research
09/12/2023

Privacy-Preserving Linkage of Distributed Datasets using the Personal Health Train

With the generation of personal and medical data at several locations, m...
research
09/04/2019

Big Data Intelligence Using Distributed Deep Neural Networks

Large amount of data is often required to train and deploy useful machin...
research
03/24/2021

Distributed Learning for Melanoma Classification using Personal Health Train

Skin cancer is the most common cancer type. Usually, patients with suspi...
research
08/27/2019

A Security-Aware Access Model for Data-Driven EHR System

Digital healthcare systems are very popular lately, as they provide a va...
research
02/20/2023

Personalized and privacy-preserving federated heterogeneous medical image analysis with PPPML-HMI

Heterogeneous data is endemic due to the use of diverse models and setti...
research
09/07/2020

Conquery: an open source application to analyze high content healthcare data

Background: Big data in healthcare must be exploited to achieve a substa...

Please sign up or login with your details

Forgot password? Click here to reset