Getting pwn'd by AI: Penetration Testing with Large Language Models

07/24/2023
by   Andreas Happe, et al.
0

The field of software security testing, more specifically penetration testing, is an activity that requires high levels of expertise and involves many manual testing and analysis steps. This paper explores the potential usage of large-language models, such as GPT3.5, to augment penetration testers with AI sparring partners. We explore the feasibility of supplementing penetration testers with AI models for two distinct use cases: high-level task planning for security testing assignments and low-level vulnerability hunting within a vulnerable virtual machine. For the latter, we implemented a closed-feedback loop between LLM-generated low-level actions with a vulnerable virtual machine (connected through SSH) and allowed the LLM to analyze the machine state for vulnerabilities and suggest concrete attack vectors which were automatically executed within the virtual machine. We discuss promising initial results, detail avenues for improvement, and close deliberating on the ethics of providing AI-based sparring partners.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/08/2023

Towards Autonomous Testing Agents via Conversational Large Language Models

Software testing is an important part of the development cycle, yet it r...
research
04/07/2022

Transformer-Based Language Models for Software Vulnerability Detection: Performance, Model's Security and Platforms

The large transformer-based language models demonstrate excellent perfor...
research
07/13/2023

SecureFalcon: The Next Cyber Reasoning System for Cyber Security

Software vulnerabilities leading to various detriments such as crashes, ...
research
08/13/2023

Ground Manipulator Primitive Tasks to Executable Actions using Large Language Models

Layered architectures have been widely used in robot systems. The majori...
research
09/21/2023

HiCRISP: A Hierarchical Closed-Loop Robotic Intelligent Self-Correction Planner

The integration of Large Language Models (LLMs) into robotics has revolu...
research
05/04/2023

"Oops, Did I Just Say That?" Testing and Repairing Unethical Suggestions of Large Language Models with Suggest-Critique-Reflect Process

As the popularity of large language models (LLMs) soars across various a...
research
12/31/2022

Action Codes

We provide a new perspective on the problem how high-level state machine...

Please sign up or login with your details

Forgot password? Click here to reset