Classifying variety of customer's online engagement for churn prediction with mixed-penalty logistic regression

05/17/2021
by   Petra Posedel Šimović, et al.
0

Using big data to analyze consumer behavior can provide effective decision-making tools for preventing customer attrition (churn) in customer relationship management (CRM). Focusing on a CRM dataset with several different categories of factors that impact customer heterogeneity (i.e., usage of self-care service channels, duration of service, and responsiveness to marketing actions), we provide new predictive analytics of customer churn rate based on a machine learning method that enhances the classification of logistic regression by adding a mixed penalty term. The proposed penalized logistic regression can prevent overfitting when dealing with big data and minimize the loss function when balancing the cost from the median (absolute value) and mean (squared value) regularization. We show the analytical properties of the proposed method and its computational advantage in this research. In addition, we investigate the performance of the proposed method with a CRM data set (that has a large number of features) under different settings by efficiently eliminating the disturbance of (1) least important features and (2) sensitivity from the minority (churn) class. Our empirical results confirm the expected performance of the proposed method in full compliance with the common classification criteria (i.e., accuracy, precision, and recall) for evaluating machine learning methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/18/2021

Scalable Econometrics on Big Data – The Logistic Regression on Spark

Extra-large datasets are becoming increasingly accessible, and computing...
research
04/27/2016

Local Uncertainty Sampling for Large-Scale Multi-Class Logistic Regression

A major challenge for building statistical models in the big data era is...
research
10/25/2014

An Aggregation Method for Sparse Logistic Regression

L_1 regularized logistic regression has now become a workhorse of data m...
research
09/09/2020

Regularised Text Logistic Regression: Key Word Detection and Sentiment Classification for Online Reviews

Online customer reviews have become important for managers and executive...
research
01/13/2015

Random Bits Regression: a Strong General Predictor for Big Data

To improve accuracy and speed of regressions and classifications, we pre...
research
01/02/2019

An Automatic Interaction Detection Hybrid Model for Bankcard Response Classification

In this paper, we propose a hybrid bankcard response model, which integr...
research
03/12/2022

A combined approach to the analysis of speech conversations in a contact center domain

The ever more accurate search for deep analysis in customer data is a re...

Please sign up or login with your details

Forgot password? Click here to reset