Towards interpreting ML-based automated malware detection models: a survey

01/15/2021
by   Yuzhou Lin, et al.
0

Malware is being increasingly threatening and malware detectors based on traditional signature-based analysis are no longer suitable for current malware detection. Recently, the models based on machine learning (ML) are developed for predicting unknown malware variants and saving human strength. However, most of the existing ML models are black-box, which made their pre-diction results undependable, and therefore need further interpretation in order to be effectively deployed in the wild. This paper aims to examine and categorize the existing researches on ML-based malware detector interpretability. We first give a detailed comparison over the previous work on common ML model inter-pretability in groups after introducing the principles, attributes, evaluation indi-cators and taxonomy of common ML interpretability. Then we investigate the interpretation methods towards malware detection, by addressing the importance of interpreting malware detectors, challenges faced by this field, solutions for migitating these challenges, and a new taxonomy for classifying all the state-of-the-art malware detection interpretability work in recent years. The highlight of our survey is providing a new taxonomy towards malware detection interpreta-tion methods based on the common taxonomy summarized by previous re-searches in the common field. In addition, we are the first to evaluate the state-of-the-art approaches by interpretation method attributes to generate the final score so as to give insight to quantifying the interpretability. By concluding the results of the recent researches, we hope our work can provide suggestions for researchers who are interested in the interpretability on ML-based malware de-tection models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/27/2020

Interpreting Machine Learning Malware Detectors Which Leverage N-gram Analysis

In cyberattack detection and prevention systems, cybersecurity analysts ...
research
01/03/2023

Analysis of Label-Flip Poisoning Attack on Machine Learning Based Malware Detector

With the increase in machine learning (ML) applications in different dom...
research
08/30/2021

ML-based IoT Malware Detection Under Adversarial Settings: A Systematic Evaluation

The rapid growth of the Internet of Things (IoT) devices is paralleled b...
research
09/15/2019

I-MAD: A Novel Interpretable Malware Detector Using Hierarchical Transformer

Malware imposes tremendous threats to computer users nowadays. Since sig...
research
03/21/2021

Towards Improving the Trustworthiness of Hardware based Malware Detector using Online Uncertainty Estimation

Hardware-based Malware Detectors (HMDs) using Machine Learning (ML) mode...
research
11/15/2022

The Lean Data Scientist: Recent Advances towards Overcoming the Data Bottleneck

Machine learning (ML) is revolutionizing the world, affecting almost eve...
research
03/28/2023

A Survey on Malware Detection with Graph Representation Learning

Malware detection has become a major concern due to the increasing numbe...

Please sign up or login with your details

Forgot password? Click here to reset