A novel interpretable machine learning system to generate clinical risk scores: An application for predicting early mortality or unplanned readmission in a retrospective cohort

01/10/2022
by   Yilin Ning, et al.
7

Risk scores are widely used for clinical decision making and commonly generated from logistic regression models. Machine-learning-based methods may work well for identifying important predictors, but such 'black box' variable selection limits interpretability, and variable importance evaluated from a single model can be biased. We propose a robust and interpretable variable selection approach using the recently developed Shapley variable importance cloud (ShapleyVIC) that accounts for variability across models. Our approach evaluates and visualizes overall variable contributions for in-depth inference and transparent variable selection, and filters out non-significant contributors to simplify model building steps. We derive an ensemble variable ranking from variable contributions, which is easily integrated with an automated and modularized risk score generator, AutoScore, for convenient implementation. In a study of early death or unplanned readmission, ShapleyVIC selected 6 of 41 candidate variables to create a well-performing model, which had similar performance to a 16-variable model from machine-learning-based ranking.

READ FULL TEXT

page 24

page 27

page 28

research
06/13/2021

AutoScore-Survival: Developing interpretable machine learning-based time-to-event scores with right-censored survival data

Scoring systems are highly interpretable and widely used to evaluate tim...
research
12/16/2022

Shapley variable importance cloud for machine learning models

Current practice in interpretable machine learning often focuses on expl...
research
09/17/2019

Impact of novel aggregation methods for flexible, time-sensitive EHR prediction without variable selection or cleaning

Dynamic assessment of patient status (e.g. by an automated, continuously...
research
04/22/2016

Developing an ICU scoring system with interaction terms using a genetic algorithm

ICU mortality scoring systems attempt to predict patient mortality using...
research
02/17/2022

AutoScore-Ordinal: An interpretable machine learning framework for generating scoring models for ordinal outcomes

Background: Risk prediction models are useful tools in clinical decision...
research
11/19/2020

Modelling fertility potential in survivors of childhood cancer: An introduction to modern statistical and computational methods

Statistical and computational methods are widely used in today's scienti...
research
07/13/2021

AutoScore-Imbalance: An interpretable machine learning tool for development of clinical scores with rare events data

Background: Medical decision-making impacts both individual and public h...

Please sign up or login with your details

Forgot password? Click here to reset