Neural Networks Optimizations Against Concept and Data Drift in Malware Detection

08/21/2023
by   William Maillet, et al.
0

Despite the promising results of machine learning models in malware detection, they face the problem of concept drift due to malware constant evolution. This leads to a decline in performance over time, as the data distribution of the new files differs from the training one, requiring regular model update. In this work, we propose a model-agnostic protocol to improve a baseline neural network to handle with the drift problem. We show the importance of feature reduction and training with the most recent validation set possible, and propose a loss function named Drift-Resilient Binary Cross-Entropy, an improvement to the classical Binary Cross-Entropy more effective against drift. We train our model on the EMBER dataset (2018) and evaluate it on a dataset of recent malicious files, collected between 2020 and 2023. Our improved model shows promising results, detecting 15.2 than a baseline model.

READ FULL TEXT
research
08/09/2022

Robust Machine Learning for Malware Detection over Time

The presence and persistence of Android malware is an on-going threat th...
research
03/12/2018

Adversarial Malware Binaries: Evading Deep Learning for Malware Detection in Executables

Machine-learning methods have already been exploited as useful tools for...
research
05/24/2022

Fast Furious: Modelling Malware Detection as Evolving Data Streams

Malware is a major threat to computer systems and imposes many challenge...
research
10/08/2020

Transcending Transcend: Revisiting Malware Classification with Conformal Evaluation

Machine learning for malware classification shows encouraging results, b...
research
04/12/2018

EMBER: An Open Dataset for Training Static PE Malware Machine Learning Models

This paper describes EMBER: a labeled benchmark dataset for training mac...
research
02/08/2023

Continuous Learning for Android Malware Detection

Machine learning methods can detect Android malware with very high accur...
research
08/28/2020

A Network-Assisted Approach for Ransomware Detection

Ransomware is a kind of malware using cryptographic mechanisms to preven...

Please sign up or login with your details

Forgot password? Click here to reset