That looks interesting! Personalizing Communication and Segmentation with Random Forest Node Embeddings

09/13/2020
by   Weiwei Wang, et al.
0

Communicating effectively with customers is a challenge for many marketers, but especially in a context that is both pivotal to individual long-term financial well-being and difficult to understand: pensions. Around the world, participants are reluctant to consider their pension in advance, it leads to a lack of preparation of their pension retirement [1], [2]. In order to engage participants to obtain information on their expected pension benefits, personalizing the pension providers' email communication is a first and crucial step. We describe a machine learning approach to model email newsletters to fit participants' interests. The data for the modeling and analysis is collected from newsletters sent by a large Dutch pension provider of the Netherlands and is divided into two parts. The first part comprises 2,228,000 customers whereas the second part comprises the data of a pilot study, which took place in July 2018 with 465,711 participants. In both cases, our algorithm extracts features from continuous and categorical data using random forests, and then calculates node embeddings of the decision boundaries of the random forest. We illustrate the algorithm's effectiveness for the classification task, and how it can be used to perform data mining tasks. In order to confirm that the result is valid for more than one data set, we also illustrate the properties of our algorithm in benchmark data sets concerning churning. In the data sets considered, the proposed modeling demonstrates competitive performance with respect to other state of the art approaches based on random forests, achieving the best Area Under the Curve (AUC) in the pension data set (0.948). For the descriptive part, the algorithm can identify customer segmentations that can be used by marketing departments to better target their communication towards their customers.

READ FULL TEXT
research
03/29/2023

Local Interpretability of Random Forests for Multi-Target Regression

Multi-target regression is useful in a plethora of applications. Althoug...
research
03/28/2021

Symbolic regression outperforms other models for small data sets

Machine learning is often applied to obtain predictions and new understa...
research
10/04/2013

Narrowing the Gap: Random Forests In Theory and In Practice

Despite widespread interest and practical use, the theoretical propertie...
research
11/24/2020

The Application of Data Mining in the Production Processes

Traditional statistical and measurements are unable to solve all industr...
research
04/05/2020

XtracTree for Regulator Validation of Bagging Methods Used in Retail Banking

Bootstrap aggregation, known as bagging, is one of the most popular ense...
research
03/15/2017

Random Forests and VGG-NET: An Algorithm for the ISIC 2017 Skin Lesion Classification Challenge

This manuscript briefly describes an algorithm developed for the ISIC 20...
research
01/05/2023

Random forests, sound symbolism and Pokemon evolution

This study constructs machine learning algorithms that are trained to cl...

Please sign up or login with your details

Forgot password? Click here to reset