Scalable Learning of Intrusion Responses through Recursive Decomposition

09/06/2023
by   Kim Hammar, et al.
0

We study automated intrusion response for an IT infrastructure and formulate the interaction between an attacker and a defender as a partially observed stochastic game. To solve the game we follow an approach where attack and defense strategies co-evolve through reinforcement learning and self-play toward an equilibrium. Solutions proposed in previous work prove the feasibility of this approach for small infrastructures but do not scale to realistic scenarios due to the exponential growth in computational complexity with the infrastructure size. We address this problem by introducing a method that recursively decomposes the game into subgames which can be solved in parallel. Applying optimal stopping theory we show that the best response strategies in these subgames exhibit threshold structures, which allows us to compute them efficiently. To solve the decomposed game we introduce an algorithm called Decompositional Fictitious Self-Play (DFSP), which learns Nash equilibria through stochastic approximation. We evaluate the learned strategies in an emulation environment where real intrusions and response actions can be executed. The results show that the learned strategies approximate an equilibrium and that DFSP significantly outperforms a state-of-the-art algorithm for a realistic infrastructure configuration.

READ FULL TEXT

page 1

page 9

research
01/11/2023

Learning Near-Optimal Intrusion Responses Against Dynamic Attackers

We study automated intrusion response and formulate the interaction betw...
research
05/29/2022

Learning Security Strategies through Game Play and Optimal Stopping

We study automated intrusion prevention using reinforcement learning. Fo...
research
09/17/2020

Finding Effective Security Strategies through Reinforcement Learning and Self-Play

We present a method to automatically find security strategies for the us...
research
05/05/2020

Stochastic Stackelberg games

In this paper, we consider a discrete-time stochastic Stackelberg game w...
research
10/30/2021

Intrusion Prevention through Optimal Stopping

We study automated intrusion prevention using reinforcement learning. Fo...
research
01/26/2022

An Efficient Approximation Algorithm for the Colonel Blotto Game

In the storied Colonel Blotto game, two colonels allocate a and b troops...
research
12/11/2011

Adaptive Forgetting Factor Fictitious Play

It is now well known that decentralised optimisation can be formulated a...

Please sign up or login with your details

Forgot password? Click here to reset