A Novel Data-Driven Framework for Risk Characterization and Prediction from Electronic Medical Records: A Case Study of Renal Failure

11/29/2017
by   Prithwish Chakraborty, et al.
0

Electronic medical records (EMR) contain longitudinal information about patients that can be used to analyze outcomes. Typically, studies on EMR data have worked with established variables that have already been acknowledged to be associated with certain outcomes. However, EMR data may also contain hitherto unrecognized factors for risk association and prediction of outcomes for a disease. In this paper, we present a scalable data-driven framework to analyze EMR data corpus in a disease agnostic way that systematically uncovers important factors influencing outcomes in patients, as supported by data and without expert guidance. We validate the importance of such factors by using the framework to predict for the relevant outcomes. Specifically, we analyze EMR data covering approximately 47 million unique patients to characterize renal failure (RF) among type 2 diabetic (T2DM) patients. We propose a specialized L1 regularized Cox Proportional Hazards (CoxPH) survival model to identify the important factors from those available from patient encounter history. To validate the identified factors, we use a specialized generalized linear model (GLM) to predict the probability of renal failure for individual patients within a specified time window. Our experiments indicate that the factors identified via our data-driven method overlap with the patient characteristics recognized by experts. Our approach allows for scalable, repeatable and efficient utilization of data available in EMRs, confirms prior medical knowledge and can generate new hypothesis without expert supervision.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/25/2019

Predicting Stroke from Electronic Health Records

Studies have identified various risk factors associated with the onset o...
research
11/18/2019

Predicting colorectal polyp recurrence using time-to-event analysis of medical records

Identifying patient characteristics that influence the rate of colorecta...
research
07/18/2019

Application of Cox Model to predict the survival of patients with Chronic Heart Failure: A latent class regression approach

Most prediction models that are used in medical research fail to accurat...
research
02/19/2018

Simultaneous Modeling of Multiple Complications for Risk Profiling in Diabetes Care

Type 2 diabetes mellitus (T2DM) is a chronic disease that often results ...
research
03/01/2022

A predictive analytics approach for stroke prediction using machine learning and neural networks

The negative impact of stroke in society has led to concerted efforts to...
research
11/03/2020

Sanguine: Visual Analysis for Patient Blood Management

Blood transfusion is a frequently performed medical procedure in surgica...
research
04/03/2018

Hospital Readmission Prediction - Applying Hierarchical Sparsity Norms for Interpretable Models

Hospital readmissions have become one of the key measures of healthcare ...

Please sign up or login with your details

Forgot password? Click here to reset