NSGZero: Efficiently Learning Non-Exploitable Policy in Large-Scale Network Security Games with Neural Monte Carlo Tree Search

01/17/2022
by   Wanqi Xue, et al.
0

How resources are deployed to secure critical targets in networks can be modelled by Network Security Games (NSGs). While recent advances in deep learning (DL) provide a powerful approach to dealing with large-scale NSGs, DL methods such as NSG-NFSP suffer from the problem of data inefficiency. Furthermore, due to centralized control, they cannot scale to scenarios with a large number of resources. In this paper, we propose a novel DL-based method, NSGZero, to learn a non-exploitable policy in NSGs. NSGZero improves data efficiency by performing planning with neural Monte Carlo Tree Search (MCTS). Our main contributions are threefold. First, we design deep neural networks (DNNs) to perform neural MCTS in NSGs. Second, we enable neural MCTS with decentralized control, making NSGZero applicable to NSGs with many resources. Third, we provide an efficient learning paradigm, to achieve joint training of the DNNs in NSGZero. Compared to state-of-the-art algorithms, our method achieves significantly better data efficiency and scalability.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/14/2023

Beyond Games: A Systematic Review of Neural Monte Carlo Tree Search Applications

The advent of AlphaGo and its successors marked the beginning of a new p...
research
06/02/2021

Solving Large-Scale Extensive-Form Network Security Games via Neural Fictitious Self-Play

Securing networked infrastructures is important in the real world. The p...
research
05/31/2019

Multiple Policy Value Monte Carlo Tree Search

Many of the strongest game playing programs use a combination of Monte C...
research
12/14/2020

Learning to Stop: Dynamic Simulation Monte-Carlo Tree Search

Monte Carlo tree search (MCTS) has achieved state-of-the-art results in ...
research
12/18/2020

Which Heroes to Pick? Learning to Draft in MOBA Games with Neural Networks and Tree Search

Hero drafting is essential in MOBA game playing as it builds the team of...
research
07/01/2020

Convex Regularization in Monte-Carlo Tree Search

Monte-Carlo planning and Reinforcement Learning (RL) are essential to se...
research
10/30/2020

Bayesian Optimization Meets Laplace Approximation for Robotic Introspection

In robotics, deep learning (DL) methods are used more and more widely, b...

Please sign up or login with your details

Forgot password? Click here to reset