Limitations of ROC on Imbalanced Data: Evaluation of LVAD Mortality Risk Scores

10/29/2020
by   Faezeh Movahedi, et al.
9

Objective: This study illustrates the ambiguity of ROC in evaluating two classifiers of 90-day LVAD mortality. This paper also introduces the precision recall curve (PRC) as a supplemental metric that is more representative of LVAD classifiers performance in predicting the minority class. Background: In the LVAD domain, the receiver operating characteristic (ROC) is a commonly applied metric of performance of classifiers. However, ROC can provide a distorted view of classifiers ability to predict short-term mortality due to the overwhelmingly greater proportion of patients who survive, i.e. imbalanced data. Methods: This study compared the ROC and PRC for the outcome of two classifiers for 90-day LVAD mortality for 800 patients (test group) recorded in INTERMACS who received a continuous-flow LVAD between 2006 and 2016 (mean age of 59 years; 146 females vs. 654 males) in which mortality rate is only 90-day (imbalanced data). The two classifiers were HeartMate Risk Score (HMRS) and a Random Forest (RF). Results: The ROC indicates fairly good performance of RF and HRMS classifiers with Area Under Curves (AUC) of 0.77 vs. 0.63, respectively. This is in contrast with their PRC with AUC of 0.43 vs. 0.16 for RF and HRMS, respectively. The PRC for HRMS showed the precision rapidly dropped to only 10 with slightly increasing sensitivity. Conclusion: The ROC can portray an overly-optimistic performance of a classifier or risk score when applied to imbalanced data. The PRC provides better insight about the performance of a classifier by focusing on the minority class.

READ FULL TEXT

page 2

page 3

page 4

page 6

page 7

research
04/30/2022

Electrocardiographic Deep Learning for Predicting Post-Procedural Mortality

Background. Pre-operative risk assessments used in clinical practice are...
research
03/13/2019

Predicting class-imbalanced business risk using resampling, regularization, and model ensembling algorithms

We aim at developing and improving the imbalanced business risk modeling...
research
12/15/2020

On the Importance of Diversity in Re-Sampling for Imbalanced Data and Rare Events in Mortality Risk Models

Surgical risk increases significantly when patients present with comorbi...
research
12/08/2019

Feature Engineering Combined with 1 D Convolutional Neural Network for Improved Mortality Prediction

The intensive care units (ICUs) are responsible for generating a wealth ...
research
02/09/2020

A Physiology-Driven Computational Model for Post-Cardiac Arrest Outcome Prediction

Patients resuscitated from cardiac arrest (CA) face a high risk of neuro...
research
09/04/2023

Survival Prediction from Imbalance colorectal cancer dataset using hybrid sampling methods and tree-based classifiers

Background and Objective: Colorectal cancer is a high mortality cancer. ...

Please sign up or login with your details

Forgot password? Click here to reset