Influence of the Event Rate on Discrimination Abilities of Bankruptcy Prediction Models

03/10/2018
by   Lili Zhang, et al.
0

In bankruptcy prediction, the proportion of events is very low, which is often oversampled to eliminate this bias. In this paper, we study the influence of the event rate on discrimination abilities of bankruptcy prediction models. First the statistical association and significance of public records and firmographics indicators with the bankruptcy were explored. Then the event rate was oversampled from 0.12 models were developed, including Logistic Regression, Decision Tree, Random Forest, Gradient Boosting, Support Vector Machine, Bayesian Network, and Neural Network. Under different event rates, models were comprehensively evaluated and compared based on Kolmogorov-Smirnov Statistic, accuracy, F1 score, Type I error, Type II error, and ROC curve on the hold-out dataset with their best probability cut-offs. Results show that Bayesian Network is the most insensitive to the event rate, while Support Vector Machine is the most sensitive.

READ FULL TEXT
research
12/12/2020

Yelp Review Rating Prediction: Machine Learning and Deep Learning Models

We predict restaurant ratings from Yelp reviews based on Yelp Open Datas...
research
02/02/2023

Analysis of Biomass Sustainability Indicators from a Machine Learning Perspective

Plant biomass estimation is critical due to the variability of different...
research
10/30/2019

A Classifiers Voting Model for Exit Prediction of Privately Held Companies

Predicting the exit (e.g. bankrupt, acquisition, etc.) of privately held...
research
09/21/2022

Leak Detection in Natural Gas Pipeline Using Machine Learning Models

Leak detection in gas pipelines is an important and persistent problem i...
research
08/06/2021

A Deep Neural Network Approach for Crop Selection and Yield Prediction in Bangladesh

Agriculture is the essential ingredients to mankind which is a major sou...
research
06/13/2021

SASICM A Multi-Task Benchmark For Subtext Recognition

Subtext is a kind of deep semantics which can be acquired after one or m...
research
08/16/2022

Ex-Ante Assessment of Discrimination in Dataset

Data owners face increasing liability for how the use of their data coul...

Please sign up or login with your details

Forgot password? Click here to reset