Boosting insights in insurance tariff plans with tree-based machine learning

04/12/2019
by   Roel Henckaerts, et al.
0

Pricing actuaries typically stay within the framework of generalized linear models (GLMs). With the upswing of data analytics, our study puts focus on machine learning to develop full tariff plans built from both the frequency and severity of claims. We adapt the loss functions used in the algorithms such that the specific characteristics of insurance data are carefully incorporated: highly unbalanced count data with excess zeros and varying exposure on the frequency side combined with scarce, but potentially long-tailed data on the severity side. A key requirement is the need for transparent and interpretable pricing models which are easily explainable to all stakeholders. We therefore focus on machine learning with decision trees: starting from simple regression trees, we work towards more advanced ensembles such as random forests and boosted trees. We show how to choose the optimal tuning parameters for these models in an elaborate cross-validation scheme, we present visualization tools to obtain insights from the resulting models and the economic value of these new modeling approaches is evaluated. Boosted trees outperform the classical GLMs, allowing the insurer to form profitable portfolios and to guard against potential adverse selection risks.

READ FULL TEXT

page 12

page 18

page 21

research
03/03/2023

Bayesian CART models for insurance claims frequency

Accuracy and interpretability of a (non-life) insurance pricing model ar...
research
04/19/2021

The Impact of Hyper-Parameter Tuning for Landscape-Aware Performance Regression and Algorithm Selection

Automated algorithm selection and configuration methods that build on ex...
research
03/28/2021

Symbolic regression outperforms other models for small data sets

Machine learning is often applied to obtain predictions and new understa...
research
12/01/2021

VisRuler: Visual Analytics for Extracting Decision Rules from Bagged and Boosted Decision Trees

Bagging and boosting are two popular ensemble methods in machine learnin...
research
10/10/2018

Equality Constrained Decision Trees: For the Algorithmic Enforcement of Group Fairness

Fairness, through its many forms and definitions, has become an importan...
research
09/23/2020

Accurate and Interpretable Machine Learning for Transparent Pricing of Health Insurance Plans

Health insurance companies cover half of the United States population th...
research
11/02/2022

On the Safety of Interpretable Machine Learning: A Maximum Deviation Approach

Interpretable and explainable machine learning has seen a recent surge o...

Please sign up or login with your details

Forgot password? Click here to reset