Deep Learning based Frameworks for Handling Imbalance in DGA, Email, and URL Data Analysis

03/31/2020
by   Simran K, et al.
0

Deep learning is a state of the art method for a lot of applications. The main issue is that most of the real-time data is highly imbalanced in nature. In order to avoid bias in training, cost-sensitive approach can be used. In this paper, we propose cost-sensitive deep learning based frameworks and the performance of the frameworks is evaluated on three different Cyber Security use cases which are Domain Generation Algorithm (DGA), Electronic mail (Email), and Uniform Resource Locator (URL). Various experiments were performed using cost-insensitive as well as cost-sensitive methods and parameters for both of these methods are set based on hyperparameter tuning. In all experiments, the cost-sensitive deep learning methods performed better than the cost-insensitive approaches. This is mainly due to the reason that cost-sensitive approach gives importance to the classes which have a very less number of samples during training and this helps to learn all the classes in a more efficient manner.

READ FULL TEXT
research
10/06/2021

Influence-Balanced Loss for Imbalanced Visual Classification

In this paper, we propose a balancing training method to address problem...
research
02/09/2018

Deep Learning for Malicious Flow Detection

Cyber security has grown up to be a hot issue in recent years. How to id...
research
09/08/2021

Knowledge Learning-based Adaptable System for Sensitive Information Identification and Handling

Diagnostic data such as logs and memory dumps from production systems ar...
research
05/08/2023

A LSTM and Cost-Sensitive Learning-Based Real-Time Warning for Civil Aviation Over-limit

The issue of over-limit during passenger aircraft flights has drawn incr...
research
03/31/2020

Deep Learning Approach for Enhanced Cyber Threat Indicators in Twitter Stream

In recent days, the amount of Cyber Security text data shared via social...
research
09/17/2022

AdaCC: Cumulative Cost-Sensitive Boosting for Imbalanced Classification

Class imbalance poses a major challenge for machine learning as most sup...
research
07/15/2015

Untangling AdaBoost-based Cost-Sensitive Classification. Part II: Empirical Analysis

A lot of approaches, each following a different strategy, have been prop...

Please sign up or login with your details

Forgot password? Click here to reset