Medical Concept Representation Learning from Electronic Health Records and its Application on Heart Failure Prediction

02/11/2016
by   Edward Choi, et al.
0

Objective: To transform heterogeneous clinical data from electronic health records into clinically meaningful constructed features using data driven method that rely, in part, on temporal relations among data. Materials and Methods: The clinically meaningful representations of medical concepts and patients are the key for health analytic applications. Most of existing approaches directly construct features mapped to raw data (e.g., ICD or CPT codes), or utilize some ontology mapping such as SNOMED codes. However, none of the existing approaches leverage EHR data directly for learning such concept representation. We propose a new way to represent heterogeneous medical concepts (e.g., diagnoses, medications and procedures) based on co-occurrence patterns in longitudinal electronic health records. The intuition behind the method is to map medical concepts that are co-occuring closely in time to similar concept vectors so that their distance will be small. We also derive a simple method to construct patient vectors from the related medical concept vectors. Results: For qualitative evaluation, we study similar medical concepts across diagnosis, medication and procedure. In quantitative evaluation, our proposed representation significantly improves the predictive modeling performance for onset of heart failure (HF), where classification methods (e.g. logistic regression, neural network, support vector machine and K-nearest neighbors) achieve up to 23 using this proposed representation. Conclusion: We proposed an effective method for patient and medical concept representation learning. The resulting representation can map relevant concepts together and also improves predictive modeling performance.

READ FULL TEXT
research
07/19/2019

Snomed2Vec: Random Walk and Poincaré Embeddings of a Clinical Knowledge Base for Healthcare Analytics

Representation learning methods that transform encoded data (e.g., diagn...
research
04/27/2020

Knowledge Base Completion for Constructing Problem-Oriented Medical Records

Both electronic health records and personal health records are typically...
research
10/15/2015

A Method for Modeling Co-Occurrence Propensity of Clinical Codes with Application to ICD-10-PCS Auto-Coding

Objective. Natural language processing methods for medical auto-coding, ...
research
11/13/2019

TASTE: Temporal and Static Tensor Factorization for Phenotyping Electronic Health Records

Phenotyping electronic health records (EHR) focuses on defining meaningf...
research
05/19/2023

LATTE: Label-efficient Incident Phenotyping from Longitudinal Electronic Health Records

Electronic health record (EHR) data are increasingly used to support rea...
research
10/30/2020

Biomedical Concept Relatedness – A large EHR-based benchmark

A promising application of AI to healthcare is the retrieval of informat...

Please sign up or login with your details

Forgot password? Click here to reset