Classifying Sequences of Extreme Length with Constant Memory Applied to Malware Detection

12/17/2020
by   Edward Raff, et al.
14

Recent works within machine learning have been tackling inputs of ever-increasing size, with cybersecurity presenting sequence classification problems of particularly extreme lengths. In the case of Windows executable malware detection, inputs may exceed 100 MB, which corresponds to a time series with T=100,000,000 steps. To date, the closest approach to handling such a task is MalConv, a convolutional neural network capable of processing up to T=2,000,000 steps. The 𝒪(T) memory of CNNs has prevented further application of CNNs to malware. In this work, we develop a new approach to temporal max pooling that makes the required memory invariant to the sequence length T. This makes MalConv 116× more memory efficient, and up to 25.8× faster to train on its original dataset, while removing the input length restrictions to MalConv. We re-invest these gains into improving the MalConv architecture by developing a new Global Channel Gating design, giving us an attention mechanism capable of learning feature interactions across 100 million time steps in an efficient manner, a capability lacked by the original MalConv CNN. Our implementation can be found at https://github.com/NeuromorphicComputationResearchProgram/MalConv2

READ FULL TEXT

page 11

page 12

page 13

research
10/25/2017

Malware Detection by Eating a Whole EXE

In this work we introduce malware detection from raw byte sequences as a...
research
04/26/2021

secml-malware: A Python Library for Adversarial Robustness Evaluation of Windows Malware Classifiers

Machine learning has been increasingly used as a first line of defense f...
research
03/24/2021

CNN vs ELM for Image-Based Malware Classification

Research in the field of malware classification often relies on machine ...
research
05/31/2023

Recasting Self-Attention with Holographic Reduced Representations

In recent years, self-attention has become the dominant paradigm for seq...
research
08/17/2021

Arbitrary-length analogs to de Bruijn sequences

Let α be a length-L cyclic sequence of characters from a size-K alphabet...
research
01/26/2023

New Approach to Malware Detection Using Optimized Convolutional Neural Network

Cyber-crimes have become a multi-billion-dollar industry in the recent y...

Please sign up or login with your details

Forgot password? Click here to reset