Weight-of-evidence 2.0 with shrinkage and spline-binning

01/05/2021
by   Jakob Raymaekers, et al.
0

In many practical applications, such as fraud detection, credit risk modeling or medical decision making, classification models for assigning instances to a predefined set of classes are required to be both precise as well as interpretable. Linear modeling methods such as logistic regression are often adopted, since they offer an acceptable balance between precision and interpretability. Linear methods, however, are not well equipped to handle categorical predictors with high-cardinality or to exploit non-linear relations in the data. As a solution, data preprocessing methods such as weight-of-evidence are typically used for transforming the predictors. The binning procedure that underlies the weight-of-evidence approach, however, has been little researched and typically relies on ad-hoc or expert driven procedures. The objective in this paper, therefore, is to propose a formalized, data-driven and powerful method. To this end, we explore the discretization of continuous variables through the binning of spline functions, which allows for capturing non-linear effects in the predictor variables and yields highly interpretable predictors taking only a small number of discrete values. Moreover, we extend upon the weight-of-evidence approach and propose to estimate the proportions using shrinkage estimators. Together, this offers an improved ability to exploit both non-linear and categorical predictors for achieving increased classification precision, while maintaining interpretability of the resulting model and decreasing the risk of overfitting. We present the results of a series of experiments in a fraud detection setting, which illustrate the effectiveness of the presented approach. We facilitate reproduction of the presented results and adoption of the proposed approaches by providing both the dataset and the code for implementing the experiments and the presented approach.

READ FULL TEXT

page 10

page 13

research
09/01/2023

Optimal Scaling transformations to model non-linear relations in GLMs with ordered and unordered predictors

In Generalized Linear Models (GLMs) it is assumed that there is a linear...
research
10/19/2021

On Clustering Categories of Categorical Predictors in Generalized Linear Models

We propose a method to reduce the complexity of Generalized Linear Model...
research
12/30/2022

Polynomial spline regression: Theory and Application

To deal with non-linear relations between the predictors and the respons...
research
10/14/2022

Variable Importance Based Interaction Modeling with an Application on Initial Spread of COVID-19 in China

Interaction selection for linear regression models with both continuous ...
research
11/28/2021

Multicriteria interpretability driven Deep Learning

Deep Learning methods are renowned for their performances, yet their lac...
research
08/29/2022

Multiresolution categorical regression for interpretable cell type annotation

In many categorical response regression applications, the response categ...
research
03/21/2019

Feature quantization for parsimonious and interpretable predictive models

For regulatory and interpretability reasons, logistic regression is stil...

Please sign up or login with your details

Forgot password? Click here to reset