FasterRisk: Fast and Accurate Interpretable Risk Scores

10/12/2022
by   Jiachang Liu, et al.
9

Over the last century, risk scores have been the most popular form of predictive model used in healthcare and criminal justice. Risk scores are sparse linear models with integer coefficients; often these models can be memorized or placed on an index card. Typically, risk scores have been created either without data or by rounding logistic regression coefficients, but these methods do not reliably produce high-quality risk scores. Recent work used mathematical programming, which is computationally slow. We introduce an approach for efficiently producing a collection of high-quality risk scores learned from data. Specifically, our approach produces a pool of almost-optimal sparse continuous solutions, each with a different support set, using a beam-search algorithm. Each of these continuous solutions is transformed into a separate risk score through a "star ray" search, where a range of multipliers are considered before rounding the coefficients sequentially to maintain low logistic loss. Our algorithm returns all of these high-quality risk scores for the user to consider. This method completes within minutes and can be valuable in a broad variety of applications.

READ FULL TEXT

page 9

page 40

research
10/01/2016

Learning Optimized Risk Scores on Large-Scale Datasets

Risk scores are simple classification models that let users quickly asse...
research
02/23/2022

Fast Sparse Classification for Generalized Linear and Additive Models

We present fast classification techniques for sparse generalized linear ...
research
04/24/2023

Sparse Private LASSO Logistic Regression

LASSO regularized logistic regression is particularly useful for its bui...
research
09/21/2022

Interpretable Selective Learning in Credit Risk

The forecasting of the credit default risk has been an important researc...
research
09/21/2016

Network-regularized Sparse Logistic Regression Models for Clinical Risk Prediction and Biomarker Discovery

Molecular profiling data (e.g., gene expression) has been used for clini...
research
02/07/2018

Directly and Efficiently Optimizing Prediction Error and AUC of Linear Classifiers

The predictive quality of machine learning models is typically measured ...
research
06/27/2019

Simultaneous Transformation and Rounding (STAR) Models for Integer-Valued Data

We propose a simple yet powerful framework for modeling integer-valued d...

Please sign up or login with your details

Forgot password? Click here to reset