Shellcode_IA32: A Dataset for Automatic Shellcode Generation

04/27/2021
by   Pietro Liguori, et al.
0

We take the first step to address the task of automatically generating shellcodes, i.e., small pieces of code used as a payload in the exploitation of a software vulnerability, starting from natural language comments. We assemble and release a novel dataset (Shellcode_IA32), consisting of challenging but common assembly instructions with their natural language descriptions. We experiment with standard methods in neural machine translation (NMT) to establish baseline performance levels on this task.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/08/2022

Can We Generate Shellcodes via Natural Language? An Empirical Study

Writing software exploits is an important practice for offensive securit...
research
10/03/2020

Code to Comment "Translation": Data, Metrics, Baselining Evaluation

The relationship of comments to code, and in particular, the task of gen...
research
09/01/2021

EVIL: Exploiting Software via Natural Language

Writing exploits for security assessment is a challenging task. The writ...
research
12/12/2022

Who Evaluates the Evaluators? On Automatic Metrics for Assessing AI-based Offensive Code Generators

AI-based code generators are an emerging solution for automatically writ...
research
02/08/2022

Towards Property-Based Tests in Natural Language

We consider a new approach to generate tests from natural language. Rath...
research
03/29/2022

Can NMT Understand Me? Towards Perturbation-based Evaluation of NMT Models for Code Generation

Neural Machine Translation (NMT) has reached a level of maturity to be r...
research
06/21/2019

Neural Machine Translating from Natural Language to SPARQL

SPARQL is a highly powerful query language for an ever-growing number of...

Please sign up or login with your details

Forgot password? Click here to reset