Classification and Online Clustering of Zero-Day Malware

05/01/2023
by   Olha Jurečková, et al.
0

A large amount of new malware is constantly being generated, which must not only be distinguished from benign samples, but also classified into malware families. For this purpose, investigating how existing malware families are developed and examining emerging families need to be explored. This paper focuses on the online processing of incoming malicious samples to assign them to existing families or, in the case of samples from new families, to cluster them. We experimented with seven prevalent malware families from the EMBER dataset, with four in the training set and three additional new families in the test set. Based on the classification score of the multilayer perceptron, we determined which samples would be classified and which would be clustered into new malware families. We classified 97.21 accuracy of 95.33 self-organizing map, achieving a purity from 47.61 for ten clusters. These results indicate that our approach has the potential to be applied to the classification and clustering of zero-day malware into malware families.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/07/2021

Cluster Analysis of Malware Family Relationships

In this paper, we use K-means clustering to analyze various relationship...
research
10/28/2022

A Deep Dive into VirusTotal: Characterizing and Clustering a Massive File Feed

Online scanners analyze user-submitted files with a large number of secu...
research
11/06/2017

Computer activity learning from system call time series

Using a previously introduced similarity function for the stream of syst...
research
02/28/2021

Virus-MNIST: A Benchmark Malware Dataset

The short note presents an image classification dataset consisting of 10...
research
01/29/2019

Throttling Malware Families in 2D

Malicious software are categorized into families based on their static a...
research
05/02/2023

CNS-Net: Conservative Novelty Synthesizing Network for Malware Recognition in an Open-set Scenario

We study the challenging task of malware recognition on both known and n...

Please sign up or login with your details

Forgot password? Click here to reset