ScriptNet: Neural Static Analysis for Malicious JavaScript Detection

04/01/2019
by   Jack W. Stokes, et al.
0

Malicious scripts are an important computer infection threat vector in the wild. For web-scale processing, static analysis offers substantial computing efficiencies. We propose the ScriptNet system for neural malicious JavaScript detection which is based on static analysis. We use the Convoluted Partitioning of Long Sequences (CPoLS) model, which processes Javascript files as byte sequences. Lower layers capture the sequential nature of these byte sequences while higher layers classify the resulting embedding as malicious or benign. Unlike previously proposed solutions, our model variants are trained in an end-to-end fashion allowing discriminative training even for the sequential processing layers. Evaluating this model on a large corpus of 212,408 JavaScript files indicates that the best performing CPoLS model offers a 97.20 true positive rate (TPR) for the first 60K byte subsequence at a false positive rate (FPR) of 0.50 several baseline models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/15/2018

Neural Classification of Malicious Scripts: A study with JavaScript and VBScript

Malicious scripts are an important computer infection threat vector. Our...
research
06/28/2018

Robust Neural Malware Detection Models for Emulation Sequence Learning

Malicious software, or malware, presents a continuously evolving challen...
research
03/02/2020

Graphing Website Relationships for Risk Prediction: Identifying Derived Threats to Users Based on Known Indicators

The hypothesis for the study was that the relationship based on referrer...
research
07/15/2020

Static analysis of executable files by machine learning methods

The paper describes how to detect malicious executable files based on st...
research
04/22/2018

MEADE: Towards a Malicious Email Attachment Detection Engine

Malicious email attachments are a growing delivery vector for malware. W...
research
08/20/2022

Quo Vadis: Hybrid Machine Learning Meta-Model based on Contextual and Behavioral Malware Representations

We propose a hybrid machine learning architecture that simultaneously em...
research
04/13/2018

A Deep Learning Approach to Fast, Format-Agnostic Detection of Malicious Web Content

Malicious web content is a serious problem on the Internet today. In thi...

Please sign up or login with your details

Forgot password? Click here to reset