Knowledge Graph Embedding with Electronic Health Records Data via Latent Graphical Block Model

05/31/2023
by   Junwei Lu, et al.
0

Due to the increasing adoption of electronic health records (EHR), large scale EHRs have become another rich data source for translational clinical research. Despite its potential, deriving generalizable knowledge from EHR data remains challenging. First, EHR data are generated as part of clinical care with data elements too detailed and fragmented for research. Despite recent progress in mapping EHR data to common ontology with hierarchical structures, much development is still needed to enable automatic grouping of local EHR codes to meaningful clinical concepts at a large scale. Second, the total number of unique EHR features is large, imposing methodological challenges to derive reproducible knowledge graph, especially when interest lies in conditional dependency structure. Third, the detailed EHR data on a very large patient cohort imposes additional computational challenge to deriving a knowledge network. To overcome these challenges, we propose to infer the conditional dependency structure among EHR features via a latent graphical block model (LGBM). The LGBM has a two layer structure with the first providing semantic embedding vector (SEV) representation for the EHR features and the second overlaying a graphical block model on the latent SEVs. The block structures on the graphical model also allows us to cluster synonymous features in EHR. We propose to learn the LGBM efficiently, in both statistical and computational sense, based on the empirical point mutual information matrix. We establish the statistical rates of the proposed estimators and show the perfect recovery of the block structure. Numerical results from simulation studies and real EHR data analyses suggest that the proposed LGBM estimator performs well in finite sample.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/03/2022

Modeling electronic health record data using a knowledge-graph-embedded topic model

The rapid growth of electronic health record (EHR) datasets opens up pro...
research
10/10/2018

Patient2Vec: A Personalized Interpretable Deep Representation of the Longitudinal Electronic Health Record

The wide implementation of electronic health record (EHR) systems facili...
research
08/17/2023

Development of a Knowledge Graph Embeddings Model for Pain

Pain is a complex concept that can interconnect with other concepts such...
research
10/02/2019

Robustly Extracting Medical Knowledge from EHRs: A Case Study of Learning a Health Knowledge Graph

Increasingly large electronic health records (EHRs) provide an opportuni...
research
02/28/2022

VaultDB: A Real-World Pilot of Secure Multi-Party Computation within a Clinical Research Network

Electronic health records represent a rich and growing source of clinica...
research
09/16/2021

Integrating Flowsheet Data in OMOP Common Data Model for Clinical Research

Flowsheet data presents unique challenges and opportunities for integrat...
research
06/11/2019

Graph Convolutional Transformer: Learning the Graphical Structure of Electronic Health Records

Effective modeling of electronic health records (EHR) is rapidly becomin...

Please sign up or login with your details

Forgot password? Click here to reset