Efficient Commercial Bank Customer Credit Risk Assessment Based on LightGBM and Feature Engineering

08/17/2023
by   Yanjie Sun, et al.
0

Effective control of credit risk is a key link in the steady operation of commercial banks. This paper is mainly based on the customer information dataset of a foreign commercial bank in Kaggle, and we use LightGBM algorithm to build a classifier to classify customers, to help the bank judge the possibility of customer credit default. This paper mainly deals with characteristic engineering, such as missing value processing, coding, imbalanced samples, etc., which greatly improves the machine learning effect. The main innovation of this paper is to construct new feature attributes on the basis of the original dataset so that the accuracy of the classifier reaches 0.734, and the AUC reaches 0.772, which is more than many classifiers based on the same dataset. The model can provide some reference for commercial banks' credit granting, and also provide some feature processing ideas for other similar studies.

READ FULL TEXT

page 2

page 3

research
10/05/2021

Predicting Credit Risk for Unsecured Lending: A Machine Learning Approach

Since the 1990s, there have been significant advances in the technology ...
research
12/30/2013

Assessment of Customer Credit through Combined Clustering of Artificial Neural Networks, Genetics Algorithm and Bayesian Probabilities

Today, with respect to the increasing growth of demand to get credit fro...
research
04/28/2018

Credit risk prediction in an imbalanced social lending environment

Credit risk prediction is an effective way of evaluating whether a poten...
research
11/05/2021

Feature-Level Fusion of Super-App and Telecommunication Alternative Data Sources for Credit Card Fraud Detection

Identity theft is a major problem for credit lenders when there's not en...
research
05/23/2018

Secure Credit Reporting on the Blockchain

We present a secure approach for maintaining and reporting credit histor...
research
06/22/2022

A proposed simulation technique for population stability testing in credit risk scorecards

Credit risk scorecards are logistic regression models, fitted to large a...
research
11/20/2020

PSD2 Explainable AI Model for Credit Scoring

The aim of this paper is to develop and test advanced analytical methods...

Please sign up or login with your details

Forgot password? Click here to reset