Calibration of Machine Learning Classifiers for Probability of Default Modelling

10/24/2017
by   Pedro G. Fonseca, et al.
0

Binary classification is highly used in credit scoring in the estimation of probability of default. The validation of such predictive models is based both on rank ability, and also on calibration (i.e. how accurately the probabilities output by the model map to the observed probabilities). In this study we cover the current best practices regarding calibration for binary classification, and explore how different approaches yield different results on real world credit scoring data. The limitations of evaluating credit scoring models using only rank ability metrics are explored. A benchmark is run on 18 real world datasets, and results compared. The calibration techniques used are Platt Scaling and Isotonic Regression. Also, different machine learning models are used: Logistic Regression, Random Forest Classifiers, and Gradient Boosting Classifiers. Results show that when the dataset is treated as a time series, the use of re-calibration with Isotonic Regression is able to improve the long term calibration better than the alternative methods. Using re-calibration, the non-parametric models are able to outperform the Logistic Regression on Brier Score Loss.

READ FULL TEXT

page 13

page 14

research
07/15/2021

Credit scoring using neural networks and SURE posterior probability calibration

In this article we compare the performances of a logistic regression and...
research
06/18/2012

Predicting accurate probabilities with a ranking loss

In many real-world applications of machine learning classifiers, it is e...
research
02/28/2020

UKARA 1.0 Challenge Track 1: Automatic Short-Answer Scoring in Bahasa Indonesia

We describe our third-place solution to the UKARA 1.0 challenge on autom...
research
02/09/2021

Classifier Calibration: with implications to threat scores in cybersecurity

This paper explores the calibration of a classifier output score in bina...
research
11/16/2015

Binary Classifier Calibration using an Ensemble of Near Isotonic Regression Models

Learning accurate probabilistic models from data is crucial in many prac...
research
01/13/2014

Binary Classifier Calibration: Bayesian Non-Parametric Approach

A set of probabilistic predictions is well calibrated if the events that...
research
09/18/2018

Actionable Recourse in Linear Classification

Classification models are often used to make decisions that affect human...

Please sign up or login with your details

Forgot password? Click here to reset