Reconstructing Missing EHRs Using Time-Aware Within- and Cross-Visit Information for Septic Shock Early Prediction

03/15/2022
by   Ge Gao, et al.
0

Real-world Electronic Health Records (EHRs) are often plagued by a high rate of missing data. In our EHRs, for example, the missing rates can be as high as 90 features. We propose a Time-Aware Dual-Cross-Visit missing value imputation method, named TA-DualCV, which spontaneously leverages multivariate dependencies across features and longitudinal dependencies both within- and cross-visit to maximize the information extracted from limited observable records in EHRs. Specifically, TA-DualCV captures the latent structure of missing patterns across measurements of different features and it also considers the time continuity and capture the latent temporal missing patterns based on both time-steps and irregular time-intervals. TA-DualCV is evaluated using three large real-world EHRs on two types of tasks: an unsupervised imputation task by varying mask rates up to 90 prediction of septic shock using Long Short-Term Memory (LSTM). Our results show that TA-DualCV performs significantly better than all of the existing state-of-the-art imputation baselines, such as DETROIT and TAME, on both types of tasks.

READ FULL TEXT
research
03/31/2023

A robust deep learning-based damage identification approach for SHM considering missing data

Data-driven method for Structural Health Monitoring (SHM), that mine the...
research
12/02/2018

Imputation of Clinical Covariates in Time Series

Missing data is a common problem in real-world settings and particularly...
research
08/19/2023

Contrastive Learning-based Imputation-Prediction Networks for In-hospital Mortality Risk Modeling using EHRs

Predicting the risk of in-hospital mortality from electronic health reco...
research
07/05/2021

Imputation-Free Learning from Incomplete Observations

Although recent works have developed methods that can generate estimatio...
research
08/15/2018

Development and Evaluation of Recurrent Neural Network based Models for Hourly Traffic Volume and AADT Prediction

The prediction of high-resolution hourly traffic volumes of a given road...
research
03/17/2019

Training recurrent neural networks robust to incomplete data: application to Alzheimer's disease progression modeling

Disease progression modeling (DPM) using longitudinal data is a challeng...
research
08/02/2022

Compound Density Networks for Risk Prediction using Electronic Health Records

Electronic Health Records (EHRs) exhibit a high amount of missing data d...

Please sign up or login with your details

Forgot password? Click here to reset