Telematics Combined Actuarial Neural Networks for Cross-Sectional and Longitudinal Claim Count Data

by   Francis Duval, et al.

We present novel cross-sectional and longitudinal claim count models for vehicle insurance built upon the Combined Actuarial Neural Network (CANN) framework proposed by Mario Wüthrich and Michael Merz. The CANN approach combines a classical actuarial model, such as a generalized linear model, with a neural network. This blending of models results in a two-component model comprising a classical regression model and a neural network part. The CANN model leverages the strengths of both components, providing a solid foundation and interpretability from the classical model while harnessing the flexibility and capacity to capture intricate relationships and interactions offered by the neural network. In our proposed models, we use well-known log-linear claim count regression models for the classical regression part and a multilayer perceptron (MLP) for the neural network part. The MLP part is used to process telematics car driving data given as a vector characterizing the driving behavior of each insured driver. In addition to the Poisson and negative binomial distributions for cross-sectional data, we propose a procedure for training our CANN model with a multivariate negative binomial (MVNB) specification. By doing so, we introduce a longitudinal model that accounts for the dependence between contracts from the same insured. Our results reveal that the CANN models exhibit superior performance compared to log-linear models that rely on manually engineered telematics features.


Bayesian Modeling of Nonlinear Poisson Regression with Artificial Neural Networks

Being in the era of big data, modeling and prediction of count data have...

Bayesian generalized linear model for over and under dispersed counts

Bayesian models that can handle both over and under dispersed counts are...

Review of Probability Distributions for Modeling Count Data

Count data take on non-negative integer values and are challenging to pr...

CatBoost Versus XGBoost and LightGBM: Developing Enhanced Predictive Models for Zero-Inflated Insurance Claim Data

In the property and casualty insurance industry, some challenges are pre...

Multilevel Conditional Autoregressive models for longitudinal and spatially referenced epidemiological data

The classical multilevel model fails to capture the proximity effect in ...

Statistical Models for the Number of Successful Cyber Intrusions

We propose several generalized linear models (GLMs) to predict the numbe...

Conditional expectation network for SHAP

A very popular model-agnostic technique for explaining predictive models...

Please sign up or login with your details

Forgot password? Click here to reset