Deep Stable Representation Learning on Electronic Health Records

09/03/2022
by   Yingtao Luo, et al.
16

Deep learning models have achieved promising disease prediction performance of the Electronic Health Records (EHR) of patients. However, most models developed under the I.I.D. hypothesis fail to consider the agnostic distribution shifts, diminishing the generalization ability of deep learning models to Out-Of-Distribution (OOD) data. In this setting, spurious statistical correlations that may change in different environments will be exploited, which can cause sub-optimal performances of deep learning models. The unstable correlation between procedures and diagnoses existed in the training distribution can cause spurious correlation between historical EHR and future diagnosis. To address this problem, we propose to use a causal representation learning method called Causal Healthcare Embedding (CHE). CHE aims at eliminating the spurious statistical relationship by removing the dependencies between diagnoses and procedures. We introduce the Hilbert-Schmidt Independence Criterion (HSIC) to measure the degree of independence between the embedded diagnosis and procedure features. Based on causal view analyses, we perform the sample weighting technique to get rid of such spurious relationship for the stable learning of EHR across different environments. Moreover, our proposed CHE method can be used as a flexible plug-and-play module that can enhance existing deep learning models on EHR. Extensive experiments on two public datasets and five state-of-the-art baselines unequivocally show that CHE can improve the prediction accuracy of deep learning models on out-of-distribution data by a large margin. In addition, the interpretability study shows that CHE could successfully leverage causal structures to reflect a more reasonable contribution of historical records for predictions.

READ FULL TEXT
research
07/14/2019

Modeling the Uncertainty in Electronic Health Records: a Bayesian Deep Learning Approach

Deep learning models have exhibited superior performance in predictive t...
research
08/08/2021

Unifying Heterogenous Electronic Health Records Systems via Text-Based Code Embedding

Substantial increase in the use of Electronic Health Records (EHRs) has ...
research
07/22/2019

BEHRT: Transformer for Electronic Health Records

Today, despite decades of developments in medicine and the growing inter...
research
11/20/2021

Generalizing Graph Neural Networks on Out-Of-Distribution Graphs

Graph Neural Networks (GNNs) are proposed without considering the agnost...
research
11/20/2022

Instability in clinical risk stratification models using deep learning

While it has been well known in the ML community that deep learning mode...
research
10/06/2020

Deep Representation Learning of Patient Data from Electronic Health Records (EHR): A Systematic Review

Patient representation learning refers to learning a dense mathematical ...
research
04/16/2021

Deep Stable Learning for Out-Of-Distribution Generalization

Approaches based on deep neural networks have achieved striking performa...

Please sign up or login with your details

Forgot password? Click here to reset