Playing Adaptively Against Stealthy Opponents: A Reinforcement Learning Strategy for the FlipIt Security Game

06/27/2019
by   Lisa Oakley, et al.
0

A rise in Advanced Persistant Threats (APTs) has introduced a need for robustness against long-running, stealthy attacks which circumvent existing cryptographic security guarantees. FlipIt is a security game that models the attacker-defender interactions in advanced scenarios such as APTs. Previous work analyzed extensively non-adaptive strategies in FlipIt, but adaptive strategies rise naturally in practical interactions as players receive feedback during the game. We model the FlipIt game as a Markov Decision Process and use reinforcement learning algorithms to design adaptive strategies. We prove theoretical results on the convergence of our new strategy against an opponent playing with a Periodic strategy. We confirm our analysis experimentally by extensive evaluation of the strategy against specific opponents. Our strategies converge to the optimal adaptive strategy for Periodic and Exponential opponents. Finally, we introduce a generalized Q-Learning strategy with composite states that outperforms a Greedy-based strategy for several distributions, including Periodic and Uniform, without prior knowledge of the opponent's strategy.

READ FULL TEXT
research
06/27/2019

QFlip: An Adaptive Reinforcement Learning Strategy for the FlipIt Security Game

A rise in Advanced Persistent Threats (APTs) has introduced a need for r...
research
02/28/2020

Reinforcement Learning in FlipIt

Reinforcement learning has shown much success in games such as chess, ba...
research
11/20/2020

Continuous Blackjack: Equilibrium, Deviation and Adaptive Strategy

We introduce a variant of the classic poker game blackjack – the continu...
research
07/21/2019

Online Constraint Satisfaction via Tolls in MDP Congestion Games

We consider the toll design problem that arise for a game designer of a ...
research
05/26/2020

Periodic Strategies II: Generalizations and Extensions

At a mixed Nash equilibrium, the payoff of a player does not depend on h...
research
11/28/2017

Reinforcement Mechanism Design, with Applications to Dynamic Pricing in Sponsored Search Auctions

In this study, we apply reinforcement learning techniques and propose wh...
research
05/02/2023

An Adaptive Behaviour-Based Strategy for SARs interacting with Older Adults with MCI during a Serious Game Scenario

The monotonous nature of repetitive cognitive training may cause losing ...

Please sign up or login with your details

Forgot password? Click here to reset