Hybrid Tree-based Models for Insurance Claims

06/10/2020
by   Zhiyu Quan, et al.
0

Two-part models and Tweedie generalized linear models (GLMs) have been used to model loss costs for short-term insurance contract. For most portfolios of insurance claims, there is typically a large proportion of zero claims that leads to imbalances resulting in inferior prediction accuracy of these traditional approaches. This article proposes the use of tree-based models with a hybrid structure that involves a two-step algorithm as an alternative approach to these traditional models. The first step is the construction of a classification tree to build the probability model for frequency. In the second step, we employ elastic net regression models at each terminal node from the classification tree to build the distribution model for severity. This hybrid structure captures the benefits of tuning hyperparameters at each step of the algorithm; this allows for improved prediction accuracy and tuning can be performed to meet specific business objectives. We examine and compare the predictive performance of such a hybrid tree-based structure in relation to the traditional Tweedie model using both real and synthetic datasets. Our empirical results show that these hybrid tree-based models produce more accurate predictions without the loss of intuitive interpretation.

READ FULL TEXT

page 16

page 18

research
07/15/2023

CatBoost Versus XGBoost and LightGBM: Developing Enhanced Predictive Models for Zero-Inflated Insurance Claim Data

In the property and casualty insurance industry, some challenges are pre...
research
07/05/2023

Knowledge-Guided Additive Modeling For Supervised Regression

Learning processes by exploiting restricted domain knowledge is an impor...
research
07/24/2017

Big Data Regression Using Tree Based Segmentation

Scaling regression to large datasets is a common problem in many applica...
research
03/03/2023

Bayesian CART models for insurance claims frequency

Accuracy and interpretability of a (non-life) insurance pricing model ar...
research
01/19/2021

The effect of Hybrid Principal Components Analysis on the Signal Compression Functional Regression: With EEG-fMRI Application

Objective: In some situations that exist both scalar and functional data...
research
10/31/2016

DPPred: An Effective Prediction Framework with Concise Discriminative Patterns

In the literature, two series of models have been proposed to address pr...
research
04/24/2018

DeepTriangle: A Deep Learning Approach to Loss Reserving

We propose a novel approach for loss reserving based on deep neural netw...

Please sign up or login with your details

Forgot password? Click here to reset