Evaluating resampling methods on a real-life highly imbalanced online credit card payments dataset

Various problems of any credit card fraud detection based on machine learning come from the imbalanced aspect of transaction datasets. Indeed, the number of frauds compared to the number of regular transactions is tiny and has been shown to damage learning performances, e.g., at worst, the algorithm can learn to classify all the transactions as regular. Resampling methods and cost-sensitive approaches are known to be good candidates to leverage this issue of imbalanced datasets. This paper evaluates numerous state-of-the-art resampling methods on a large real-life online credit card payments dataset. We show they are inefficient because methods are intractable or because metrics do not exhibit substantial improvements. Our work contributes to this domain in (1) that we compare many state-of-the-art resampling methods on a large-scale dataset and in (2) that we use a real-life online credit card payments dataset.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/29/2020

Approaches to Fraud Detection on Credit Card Transactions Using Artificial Intelligence Methods

Credit card fraud is an ongoing problem for almost all industries in the...
research
04/20/2018

Streaming Active Learning Strategies for Real-Life Credit Card Fraud Detection: Assessment and Visualization

Credit card fraud detection is a very challenging problem because of the...
research
12/22/2021

Evaluating categorical encoding methods on a real credit card fraud detection database

Correctly dealing with categorical data in a supervised learning context...
research
03/11/2023

Credit Card Fraud Detection Using Enhanced Random Forest Classifier for Imbalanced Data

The credit card has become the most popular payment method for both onli...
research
09/16/2020

Anomaly and Fraud Detection in Credit Card Transactions Using the ARIMA Model

This paper addresses the problem of unsupervised approach of credit card...
research
08/20/2022

Challenges and Complexities in Machine Learning based Credit Card Fraud Detection

Credit cards play an exploding role in modern economies. Its popularity ...
research
09/03/2019

Minimizing the Societal Cost of Credit Card Fraud with Limited and Imbalanced Data

Machine learning has automated much of financial fraud detection, notify...

Please sign up or login with your details

Forgot password? Click here to reset