A Naive Bayes machine learning approach to risk prediction using censored, time-to-event data

04/08/2014
by   Julian Wolfson, et al.
0

Predicting an individual's risk of experiencing a future clinical outcome is a statistical task with important consequences for both practicing clinicians and public health experts. Modern observational databases such as electronic health records (EHRs) provide an alternative to the longitudinal cohort studies traditionally used to construct risk models, bringing with them both opportunities and challenges. Large sample sizes and detailed covariate histories enable the use of sophisticated machine learning techniques to uncover complex associations and interactions, but observational databases are often "messy," with high levels of missing data and incomplete patient follow-up. In this paper, we propose an adaptation of the well-known Naive Bayes (NB) machine learning approach for classification to time-to-event outcomes subject to censoring. We compare the predictive performance of our method to the Cox proportional hazards model which is commonly used for risk prediction in healthcare populations, and illustrate its application to prediction of cardiovascular risk using an EHR dataset from a large Midwest integrated healthcare system.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/01/2018

Measuring the Stability of EHR- and EKG-based Predictive Models

Databases of electronic health records (EHRs) are increasingly used to i...
research
04/12/2022

Hybrid Feature- and Similarity-Based Models for Prediction and Interpretation using Large-Scale Observational Data

Introduction: Large-scale electronic health record(EHR) datasets often i...
research
05/08/2023

Large-Scale Study of Temporal Shift in Health Insurance Claims

Most machine learning models for predicting clinical outcomes are develo...
research
06/09/2023

Transformer-based Time-to-Event Prediction for Chronic Kidney Disease Deterioration

Deep-learning techniques, particularly the transformer model, have shown...
research
11/12/2019

Harmonic Mean Point Processes: Proportional Rate Error Minimization for Obtundation Prediction

In healthcare, the highest risk individuals for morbidity and mortality ...
research
05/09/2022

Methodology to Create Analysis-Naive Holdout Records as well as Train and Test Records for Machine Learning Analyses in Healthcare

It is common for researchers to holdout data from a study pool to be use...

Please sign up or login with your details

Forgot password? Click here to reset