Two-stage Federated Phenotyping and Patient Representation Learning

08/14/2019
by   Dianbo Liu, et al.
0

A large percentage of medical information is in unstructured text format in electronic medical record systems. Manual extraction of information from clinical notes is extremely time consuming. Natural language processing has been widely used in recent years for automatic information extraction from medical texts. However, algorithms trained on data from a single healthcare provider are not generalizable and error-prone due to the heterogeneity and uniqueness of medical documents. We develop a two-stage federated natural language processing method that enables utilization of clinical notes from different hospitals or clinics without moving the data, and demonstrate its performance using obesity and comorbities phenotyping as medical task. This approach not only improves the quality of a specific clinical task but also facilitates knowledge progression in the whole healthcare system, which is an essential part of learning health system. To the best of our knowledge, this is the first application of federated machine learning in clinical NLP.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/13/2022

EHRKit: A Python Natural Language Processing Toolkit for Electronic Health Record Texts

The Electronic Health Record (EHR) is an essential part of the modern me...
research
02/20/2020

Federated pretraining and fine tuning of BERT using clinical notes from multiple silos

Large scale contextual representation models, such as BERT, have signifi...
research
07/02/2020

NLNDE: The Neither-Language-Nor-Domain-Experts' Way of Spanish Medical Document De-Identification

Natural language processing has huge potential in the medical domain whi...
research
09/18/2018

Lung Cancer Concept Annotation from Spanish Clinical Narratives

Recent rapid increase in the generation of clinical data and rapid devel...
research
03/31/2019

Surrogate-guided sampling designs for classification of rare outcomes from electronic medical records data

Scalable and accurate identification of specific clinical outcomes has b...
research
06/23/2020

A Deep Learning Pipeline for Patient Diagnosis Prediction Using Electronic Health Records

Augmentation of disease diagnosis and decision-making in healthcare with...

Please sign up or login with your details

Forgot password? Click here to reset