A Comparative Performance Analysis of Explainable Machine Learning Models With And Without RFECV Feature Selection Technique Towards Ransomware Classification

12/09/2022
by   Rawshan Ara Mowri, et al.
0

Ransomware has emerged as one of the major global threats in recent days. The alarming increasing rate of ransomware attacks and new ransomware variants intrigue the researchers in this domain to constantly examine the distinguishing traits of ransomware and refine their detection or classification strategies. Among the broad range of different behavioral characteristics, the trait of Application Programming Interface (API) calls and network behaviors have been widely utilized as differentiating factors for ransomware detection, or classification. Although many of the prior approaches have shown promising results in detecting and classifying ransomware families utilizing these features without applying any feature selection techniques, feature selection, however, is one of the potential steps toward an efficient detection or classification Machine Learning model because it reduces the probability of overfitting by removing redundant data, improves the model's accuracy by eliminating irrelevant features, and therefore reduces training time. There have been a good number of feature selection techniques to date that are being used in different security scenarios to optimize the performance of the Machine Learning models. Hence, the aim of this study is to present the comparative performance analysis of widely utilized Supervised Machine Learning models with and without RFECV feature selection technique towards ransomware classification utilizing the API call and network traffic features. Thereby, this study provides insight into the efficiency of the RFECV feature selection technique in the case of ransomware classification which can be used by peers as a reference for future work in choosing the feature selection technique in this domain.

READ FULL TEXT

page 1

page 2

page 4

page 5

page 6

page 7

page 8

page 18

research
10/16/2022

Interpretable Machine Learning for Detection and Classification of Ransomware Families Based on API Calls

Ransomware has appeared as one of the major global threats in recent day...
research
04/19/2018

A comparative study of feature selection methods for stress hotspot classification in materials

The first step in constructing a machine learning model is defining the ...
research
12/07/2022

Fallen Angel Bonds Investment and Bankruptcy Predictions Using Manual Models and Automated Machine Learning

The primary aim of this research was to find a model that best predicts ...
research
08/28/2022

Classification and Detection of Mesothelioma Cancer Using Feature Selection-Enabled Machine Learning Technique

Cancer of the mesothelium, sometimes referred to as malignant mesothelio...
research
10/02/2019

ConfusionFlow: A model-agnostic visualization for temporal analysis of classifier confusion

Classifiers are among the most widely used supervised machine learning a...
research
12/19/2017

Ensemble Models for Detecting Wikidata Vandalism with Stacking - Team Honeyberry Vandalism Detector at WSDM Cup 2017

The WSDM Cup 2017 is a binary classification task for classifying Wikida...
research
02/25/2019

Epileptic seizure classification using statistical sampling and a novel feature selection algorithm

Epilepsy is a well-known neuronal disorder that can be identified by int...

Please sign up or login with your details

Forgot password? Click here to reset