Prediction of motor insurance claims occurrence as an imbalanced machine learning problem

04/12/2022
by   Sebastian Baran, et al.
9

The insurance industry, with its large datasets, is a natural place to use big data solutions. However it must be stressed, that significant number of applications for machine learning in insurance industry, like fraud detection or claim prediction, deals with the problem of machine learning on an imbalanced data set. This is due to the fact that frauds or claims are rare events when compared with the entire population of drivers. The problem of imbalanced learning is often hard to overcome. Therefore, the main goal of this work is to present and apply various methods of dealing with an imbalanced dataset in the context of claim occurrence prediction in car insurance. In addition, the above techniques are used to compare the results of machine learning algorithms in the context of claim occurrence prediction in car insurance. Our study covers the following techniques: logistic-regression, decision tree, random forest, xgBoost, feed-forward network. The problem is the classification one.

READ FULL TEXT

page 7

page 8

page 10

page 11

research
04/06/2019

A Novel Big Data Analytics Framework to Predict the Risk of Opioid Use Disorder

Addiction and overdose related to prescription opioids have reached an e...
research
02/21/2023

Tree-Based Machine Learning Methods For Vehicle Insurance Claims Size Prediction

Vehicle insurance claims size prediction needs methods to efficiently ha...
research
01/23/2019

Predicting the Results of LTL Model Checking using Multiple Machine Learning Algorithms

In this paper, we study how to predict the results of LTL model checking...
research
04/30/2018

An Anti-fraud System for Car Insurance Claim Based on Visual Evidence

Automatically scene understanding using machine learning algorithms has ...
research
10/29/2019

Predicting Louisiana Public High School Dropout through Imbalanced Learning Techniques

This study is motivated by the magnitude of the problem of Louisiana hig...
research
05/21/2018

Predicting Electricity Outages Caused by Convective Storms

We consider the problem of predicting power outages in an electrical pow...
research
05/28/2021

How much telematics information do insurers need for claim classification?

It has been shown several times in the literature that telematics data c...

Please sign up or login with your details

Forgot password? Click here to reset