Federated Learning of Medical Concepts Embedding using BEHRT

05/22/2023
by   Ofir Ben Shoham, et al.
0

Electronic Health Records (EHR) data contains medical records such as diagnoses, medications, procedures, and treatments of patients. This data is often considered sensitive medical information. Therefore, the EHR data from the medical centers often cannot be shared, making it difficult to create prediction models using multi-center EHR data, which is essential for such models' robustness and generalizability. Federated Learning (FL) is an algorithmic approach that allows learning a shared model using data in multiple locations without the need to store all data in a central place. An example of a prediction model's task is to predict future diseases. More specifically, the model needs to predict patient's next visit diagnoses, based on current and previous clinical data. Such a prediction model can support care providers in making clinical decisions and even provide preventive treatment. We propose a federated learning approach for learning medical concepts embedding. This pre-trained model can be used for fine-tuning for specific downstream tasks. Our approach is based on an embedding model like BEHRT, a deep neural sequence transduction model for EHR. We train using federated learning, both the Masked Language Modeling (MLM) and the next visit downstream model. We demonstrate our approach on the MIMIC-IV dataset. We compare the performance of a model trained with FL against a model trained on centralized data. We find that our federated learning approach reaches very close to the performance of a centralized model, and it outperforms local models in terms of average precision. We also show that pre-trained MLM improves the model's average precision performance in the next visit prediction task, compared to an MLM model without pre-training. Our code is available at https://github.com/nadavlab/FederatedBEHRT.

READ FULL TEXT

page 6

page 7

research
05/16/2019

BrainTorrent: A Peer-to-Peer Environment for Decentralized Federated Learning

Access to sufficient annotated data is a common challenge in training de...
research
07/24/2020

Impact of Medical Data Imprecision on Learning Results

Test data measured by medical instruments often carry imprecise ranges t...
research
04/11/2022

CXR-FL: Deep Learning-based Chest X-ray Image Analysis Using Federated Learning

Federated learning enables building a shared model from multicentre data...
research
06/23/2022

On Pre-Training for Federated Learning

In most of the literature on federated learning (FL), neural networks ar...
research
09/06/2021

Generation of Synthetic Electronic Health Records Using a Federated GAN

Sensitive medical data is often subject to strict usage constraints. In ...
research
06/06/2023

Guiding The Last Layer in Federated Learning with Pre-Trained Models

Federated Learning (FL) is an emerging paradigm that allows a model to b...
research
10/27/2019

Federated Uncertainty-Aware Learning for Distributed Hospital EHR Data

Recent works have shown that applying Machine Learning to Electronic Hea...

Please sign up or login with your details

Forgot password? Click here to reset