Multifamily Malware Models

06/27/2022
by   Samanvitha Basole, et al.
0

When training a machine learning model, there is likely to be a tradeoff between accuracy and the diversity of the dataset. Previous research has shown that if we train a model to detect one specific malware family, we generally obtain stronger results as compared to a case where we train a single model on multiple diverse families. However, during the detection phase, it would be more efficient to have a single model that can reliably detect multiple families, rather than having to score each sample against multiple models. In this research, we conduct experiments based on byte n-gram features to quantify the relationship between the generality of the training dataset and the accuracy of the corresponding machine learning models, all within the context of the malware detection problem. We find that neighborhood-based algorithms generalize surprisingly well, far outperforming the other machine learning techniques considered.

READ FULL TEXT

page 14

page 15

page 16

page 25

research
07/04/2021

Machine Learning for Malware Evolution Detection

Malware evolves over time and antivirus must adapt to such evolution. He...
research
03/24/2021

CNN vs ELM for Image-Based Malware Classification

Research in the field of malware classification often relies on machine ...
research
02/10/2020

Nested Multiple Instance Learning in Modelling of HTTP network traffic

In many interesting cases, the application of machine learning is hinder...
research
07/17/2023

Hidden Markov Models with Random Restarts vs Boosting for Malware Detection

Effective and efficient malware detection is at the forefront of researc...
research
03/13/2022

A Comparison of Static, Dynamic, and Hybrid Analysis for Malware Detection

In this research, we compare malware detection techniques based on stati...
research
03/07/2021

Word Embedding Techniques for Malware Evolution Detection

Malware detection is a critical aspect of information security. One diff...
research
04/13/2020

Local Model Feature Transformations

Local learning methods are a popular class of machine learning algorithm...

Please sign up or login with your details

Forgot password? Click here to reset