SK-Tree: a systematic malware detection algorithm on streaming trees via the signature kernel

02/16/2021
by   Thomas Cochrane, et al.
0

The development of machine learning algorithms in the cyber security domain has been impeded by the complex, hierarchical, sequential and multimodal nature of the data involved. In this paper we introduce the notion of a streaming tree as a generic data structure encompassing a large portion of real-world cyber security data. Starting from host-based event logs we represent computer processes as streaming trees that evolve in continuous time. Leveraging the properties of the signature kernel, a machine learning tool that recently emerged as a leading technology for learning with complex sequences of data, we develop the SK-Tree algorithm. SK-Tree is a supervised learning method for systematic malware detection on streaming trees that is robust to irregular sampling and high dimensionality of the underlying streams. We demonstrate the effectiveness of SK-Tree to detect malicious events on a portion of the publicly available DARPA OpTC dataset, achieving an AUROC score of 98

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/07/2019

Detection of Advanced Malware by Machine Learning Techniques

In today's digital world most of the anti-malware tools are signature ba...
research
01/05/2022

Comprehensive Efficiency Analysis of Machine Learning Algorithms for Developing Hardware-Based Cybersecurity Countermeasures

Modern computing systems have led cyber adversaries to create more sophi...
research
03/30/2022

Dynamic Model Tree for Interpretable Data Stream Learning

Data streams are ubiquitous in modern business and society. In practice,...
research
10/15/2019

Automated Ransomware Behavior Analysis: Pattern Extraction and Early Detection

Security operation centers (SOCs) typically use a variety of tools to co...
research
12/17/2013

Mining Malware Specifications through Static Reachability Analysis

The number of malicious software (malware) is growing out of control. Sy...
research
04/17/2019

Low-Latency Graph Streaming Using Compressed Purely-Functional Trees

Due to the dynamic nature of real-world graphs, there has been a growing...

Please sign up or login with your details

Forgot password? Click here to reset