Towards Learning Representations of Binary Executable Files for Security Tasks

02/09/2020
by   Shushan Arakelyan, et al.
10

Tackling binary analysis problems has traditionally implied manually defining rules and heuristics. As an alternative, we are suggesting using machine learning models for learning distributed representations of binaries that can be applicable for a number of downstream tasks. We construct a computational graph from the binary executable and use it with a graph convolutional neural network to learn a high dimensional representation of the program. We show the versatility of this approach by using our representations to solve two semantically different binary analysis tasks – algorithm classification and vulnerability discovery. We compare the proposed approach to our own strong baseline as well as published results and demonstrate improvement on the state of the art methods for both tasks.

READ FULL TEXT
research
06/10/2021

Semantic-aware Binary Code Representation with BERT

A wide range of binary analysis applications, such as bug discovery, mal...
research
03/23/2020

ProGraML: Graph-based Deep Learning for Program Optimization and Analysis

The increasing complexity of computing systems places a tremendous burde...
research
10/26/2021

Unbiased Graph Embedding with Biased Graph Observations

Graph embedding techniques have been increasingly employed in real-world...
research
06/03/2019

DANE: Domain Adaptive Network Embedding

Recent works reveal that network embedding techniques enable many machin...
research
11/20/2018

Temporal Graph Offset Reconstruction: Towards Temporally Robust Graph Representation Learning

Graphs are a commonly used construct for representing relationships betw...
research
12/11/2019

Graph Input Representations for Machine Learning Applications in Urban Network Analysis

Understanding and learning the characteristics of network paths has been...
research
09/06/2018

One-shot Learning for iEEG Seizure Detection Using End-to-end Binary Operations: Local Binary Patterns with Hyperdimensional Computing

This paper presents an efficient binarized algorithm for both learning a...

Please sign up or login with your details

Forgot password? Click here to reset