Fast and Efficient Malware Detection with Joint Static and Dynamic Features Through Transfer Learning

11/25/2022
by   Mao V. Ngo, et al.
0

In malware detection, dynamic analysis extracts the runtime behavior of malware samples in a controlled environment and static analysis extracts features using reverse engineering tools. While the former faces the challenges of anti-virtualization and evasive behavior of malware samples, the latter faces the challenges of code obfuscation. To tackle these drawbacks, prior works proposed to develop detection models by aggregating dynamic and static features, thus leveraging the advantages of both approaches. However, simply concatenating dynamic and static features raises an issue of imbalanced contribution due to the heterogeneous dimensions of feature vectors to the performance of malware detection models. Yet, dynamic analysis is a time-consuming task and requires a secure environment, leading to detection delays and high costs for maintaining the analysis infrastructure. In this paper, we first introduce a method of constructing aggregated features via concatenating latent features learned through deep learning with equally-contributed dimensions. We then develop a knowledge distillation technique to transfer knowledge learned from aggregated features by a teacher model to a student model trained only on static features and use the trained student model for the detection of new malware samples. We carry out extensive experiments with a dataset of 86709 samples including both benign and malware samples. The experimental results show that the teacher model trained on aggregated features constructed by our method outperforms the state-of-the-art models with an improvement of up to 2.38 student model not only achieves high performance (97.81 as that of the teacher model but also significantly reduces the detection time (from 70046.6 ms to 194.9 ms) without requiring dynamic analysis.

READ FULL TEXT
research
03/12/2019

Agent-based Vs Agent-less Sandbox for Dynamic Behavioral Analysis

Malicious software is detected and classified by either static analysis ...
research
12/24/2019

Integration of Static and Dynamic Analysis for Malware Family Classification with Composite Neural Network

Deep learning has been used in the research of malware analysis. Most cl...
research
07/27/2023

Decoding the Secrets of Machine Learning in Malware Classification: A Deep Dive into Datasets, Feature Extraction, and Model Performance

Many studies have proposed machine-learning (ML) models for malware dete...
research
03/13/2022

A Comparison of Static, Dynamic, and Hybrid Analysis for Malware Detection

In this research, we compare malware detection techniques based on stati...
research
01/26/2021

Malware Detection Using Frequency Domain-Based Image Visualization and Deep Learning

We propose a novel method to detect and visualize malware through image ...
research
11/08/2021

OMD: Orthogonal Malware Detection Using Audio, Image, and Static Features

With the growing number of malware and cyber attacks, there is a need fo...
research
06/23/2023

Full Transparency in DBI frameworks

Following the increasing trends of malicious applications or cyber threa...

Please sign up or login with your details

Forgot password? Click here to reset