Learning Fast and Slow: PROPEDEUTICA for Real-time Malware Detection

12/04/2017
by   Ruimin Sun, et al.
0

In this paper, we introduce and evaluate PROPEDEUTICA, a novel methodology and framework for efficient and effective real-time malware detection, leveraging the best of conventional machine learning (ML) and deep learning (DL) algorithms. In PROPEDEUTICA, all software processes in the system start execution subjected to a conventional ML detector for fast classification. If a piece of software receives a borderline classification, it is subjected to further analysis via more performance expensive and more accurate DL methods, via our newly proposed DL algorithm DEEPMALWARE. Further, we introduce delays to the execution of software subjected to deep learning analysis as a way to "buy time" for DL analysis and to rate-limit the impact of possible malware in the system. We evaluated PROPEDEUTICA with a set of 9,115 malware samples and 877 commonly used benign software samples from various categories for the Windows OS. Our results show that the false positive rate for conventional ML methods can reach 20 However, the classification time for DL can be 100X longer than conventional ML methods. PROPEDEUTICA improved the detection F1-score from 77.54 ML method) to 90.25 percentage of software subjected to DL analysis was approximately 40 average. Further, the application of delays in software subjected to ML reduced the detection time by approximately 10 discrepancy between the detection accuracy offline (analysis after all traces are collected) and on-the-fly (analysis in tandem with trace collection). Our insights show that conventional ML and modern DL-based malware detectors in isolation cannot meet the needs of efficient and effective malware detection: high accuracy, low false positive rate, and short classification time.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/07/2018

Leveraging Uncertainty for Effective Malware Mitigation

A promising avenue for improving the effectiveness of behavioral-based m...
research
01/22/2021

A novel DL approach to PE malware detection: exploring Glove vectorization, MCC_RCNN and feature fusion

In recent years, malware becomes more threatening. Concerning the increa...
research
08/09/2021

Leveraging Uncertainty for Improved Static Malware Detection Under Extreme False Positive Constraints

The detection of malware is a critical task for the protection of comput...
research
04/17/2018

Fast Flux Detection via Data Mining on Passive DNS Traffic

In the last decade, the use of fast flux technique has become establishe...
research
01/08/2018

HeNet: A Deep Learning Approach on Intel^ Processor Trace for Effective Exploit Detection

This paper presents HeNet, a hierarchical ensemble neural network, appli...
research
03/13/2020

Towards a Resilient Machine Learning Classifier – a Case Study of Ransomware Detection

The damage caused by crypto-ransomware, due to encryption, is difficult ...
research
06/29/2018

Topology classification with deep learning to improve real-time event selection at the LHC

We show how event topology classification based on deep learning could b...

Please sign up or login with your details

Forgot password? Click here to reset