Unsupervised Annotation of Phenotypic Abnormalities via Semantic Latent Representations on Electronic Health Records

11/10/2019
by   Jingqing Zhang, et al.
0

The extraction of phenotype information which is naturally contained in electronic health records (EHRs) has been found to be useful in various clinical informatics applications such as disease diagnosis. However, due to imprecise descriptions, lack of gold standards and the demand for efficiency, annotating phenotypic abnormalities on millions of EHR narratives is still challenging. In this work, we propose a novel unsupervised deep learning framework to annotate the phenotypic abnormalities from EHRs via semantic latent representations. The proposed framework takes the advantage of Human Phenotype Ontology (HPO), which is a knowledge base of phenotypic abnormalities, to standardize the annotation results. Experiments have been conducted on 52,722 EHRs from MIMIC-III dataset. Quantitative and qualitative analysis have shown the proposed framework achieves state-of-the-art annotation performance and computational efficiency compared with other methods.

READ FULL TEXT
research
11/20/2018

Unsupervised Pseudo-Labeling for Extractive Summarization on Electronic Health Records

Extractive summarization is very useful for physicians to better manage ...
research
05/31/2019

Bayesian Profiling Multiple Imputation for Missing Electronic Health Records

Electronic health records (EHRs) are increasingly used for clinical and ...
research
12/18/2019

MedCAT – Medical Concept Annotation Tool

Biomedical documents such as Electronic Health Records (EHRs) contain a ...
research
04/27/2020

Knowledge Base Completion for Constructing Problem-Oriented Medical Records

Both electronic health records and personal health records are typically...
research
09/04/2020

Phenotypical Ontology Driven Framework for Multi-Task Learning

Despite the large number of patients in Electronic Health Records (EHRs)...
research
01/26/2018

Methodological variations in lagged regression for detecting physiologic drug effects in EHR data

We studied how lagged linear regression can be used to detect the physio...
research
10/01/2020

Cardea: An Open Automated Machine Learning Framework for Electronic Health Records

An estimated 180 papers focusing on deep learning and EHR were published...

Please sign up or login with your details

Forgot password? Click here to reset