Tackling Imbalanced Data in Cybersecurity with Transfer Learning: A Case with ROP Payload Detection

05/06/2021
by   Haizhou Wang, et al.
0

In recent years, deep learning gained proliferating popularity in the cybersecurity application domain, since when being compared to traditional machine learning, it usually involves less human effort, produces better results, and provides better generalizability. However, the imbalanced data issue is very common in cybersecurity, which can substantially deteriorate the performance of the deep learning models. This paper introduces a transfer learning based method to tackle the imbalanced data issue in cybersecurity using Return-Oriented Programming (ROP) payload detection as a case study. We achieved 0.033 average false positive rate, 0.9718 average F1 score and 0.9418 average detection rate on 3 different target domain programs using 2 different source domain programs, with 0 benign training data samples in the target domain. The performance improvement compared to the baseline is a trade-off between false positive rate and detection rate. Using our approach, the number of false positives is reduced by 23.20 detected malicious samples is reduced by 0.50

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/03/2021

Detecting Bias in Transfer Learning Approaches for Text Classification

Classification is an essential and fundamental task in machine learning,...
research
12/18/2018

Deep Transfer Learning for Static Malware Classification

We propose to apply deep transfer learning from computer vision to stati...
research
02/12/2023

AIDA: Legal Judgment Predictions for Non-Professional Fact Descriptions via Partial-and-Imbalanced Domain Adaptation

In this paper, we study the problem of legal domain adaptation problem f...
research
01/04/2021

Towards Network Traffic Monitoring Using Deep Transfer Learning

Network traffic is growing at an outpaced speed globally. The modern net...
research
11/14/2021

Improving Compound Activity Classification via Deep Transfer and Representation Learning

Recent advances in molecular machine learning, especially deep neural ne...
research
08/26/2021

Using GAN-based models to sentimental analysis on imbalanced datasets in education domain

While the whole world is still struggling with the COVID-19 pandemic, on...

Please sign up or login with your details

Forgot password? Click here to reset