Feature quantization for parsimonious and interpretable predictive models

03/21/2019
by   Adrien Ehrhardt, et al.
0

For regulatory and interpretability reasons, logistic regression is still widely used. To improve prediction accuracy and interpretability, a preprocessing step quantizing both continuous and categorical data is usually performed: continuous features are discretized and, if numerous, levels of categorical features are grouped. An even better predictive accuracy can be reached by embedding this quantization estimation step directly into the predictive estimation step itself. But doing so, the predictive loss has to be optimized on a huge set. To overcome this difficulty, we introduce a specific two-step optimization strategy: first, the optimization problem is relaxed by approximating discontinuous quantization functions by smooth functions; second, the resulting relaxed optimization problem is solved via a particular neural network. The good performances of this approach, which we call glmdisc, are illustrated on simulated and real data from the UCI library and Crédit Agricole Consumer Finance (a major European historic player in the consumer credit market).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/03/2018

Relaxed Quantization for Discretized Neural Networks

Neural network quantization has become an important research area due to...
research
09/21/2022

Interpretable Selective Learning in Credit Risk

The forecasting of the credit default risk has been an important researc...
research
07/15/2021

Credit scoring using neural networks and SURE posterior probability calibration

In this article we compare the performances of a logistic regression and...
research
06/08/2020

Interpretable Signal Analysis with Knockoffs Enhances Classification of Bacterial Raman Spectra

Interpretability is important for many applications of machine learning ...
research
07/20/2016

Indebted households profiling: a knowledge discovery from database approach

A major challenge in consumer credit risk portfolio management is to cla...
research
05/07/2022

Accuracy Convergent Field Predictors

Several predictive algorithms are described. Highlighted are variants th...
research
01/05/2021

Weight-of-evidence 2.0 with shrinkage and spline-binning

In many practical applications, such as fraud detection, credit risk mod...

Please sign up or login with your details

Forgot password? Click here to reset