Interpretation of machine learning predictions for patient outcomes in electronic health records

03/14/2019
by   William La Cava, et al.
0

Electronic health records are an increasingly important resource for understanding the interactions between patient health, environment, and clinical decisions. In this paper we report an empirical study of predictive modeling of several patient outcomes using three state-of-the-art machine learning methods. Our primary goal is to validate the models by interpreting the importance of predictors in the final models. Central to interpretation is the use of feature importance scores, which vary depending on the underlying methodology. In order to assess feature importance, we compared univariate statistical tests, information-theoretic measures, permutation testing, and normalized coefficients from multivariate logistic regression models. In general we found poor correlation between methods in their assessment of feature importance, even when their performance is comparable and relatively good. However, permutation tests applied to random forest and gradient boosting models showed the most agreement, and the importance scores matched the clinical interpretation most frequently.

READ FULL TEXT

page 3

page 8

research
12/01/2018

Measuring the Stability of EHR- and EKG-based Predictive Models

Databases of electronic health records (EHRs) are increasingly used to i...
research
05/11/2021

A Bayesian Hierarchical Modeling Framework for Geospatial Analysis of Adverse Pregnancy Outcomes

Studying the determinants of adverse pregnancy outcomes like stillbirth ...
research
03/23/2021

On the global identifiability of logistic regression models with misclassified outcomes

In the last decade, the secondary use of large data from health systems,...
research
12/22/2022

Enhancing the prediction of disease outcomes using electronic health records and pretrained deep learning models

Question: Can an encoder-decoder architecture pretrained on a large data...
research
04/03/2019

Medical device surveillance with electronic health records

Post-market medical device surveillance is a challenge facing manufactur...
research
10/13/2019

Nonstationary Multivariate Gaussian Processes for Electronic Health Records

We propose multivariate nonstationary Gaussian processes for jointly mod...
research
07/06/2017

RIDDLE: Race and ethnicity Imputation from Disease history with Deep LEarning

Anonymized electronic medical records are an increasingly popular source...

Please sign up or login with your details

Forgot password? Click here to reset