Visibility Optimization for Surveillance-Evasion Games

10/18/2020
by   Louis Ly, et al.
0

We consider surveillance-evasion differential games, where a pursuer must try to constantly maintain visibility of a moving evader. The pursuer loses as soon as the evader becomes occluded. Optimal controls for game can be formulated as a Hamilton-Jacobi-Isaac equation. We use an upwind scheme to compute the feedback value function, corresponding to the end-game time of the differential game. Although the value function enables optimal controls, it is prohibitively expensive to compute, even for a single pursuer and single evader on a small grid. We consider a discrete variant of the surveillance-game. We propose two locally optimal strategies based on the static value function for the surveillance-evasion game with multiple pursuers and evaders. We show that Monte Carlo tree search and self-play reinforcement learning can train a deep neural network to generate reasonable strategies for on-line game play. Given enough computational resources and offline training time, the proposed model can continue to improve its policies and efficiently scale to higher resolutions.

READ FULL TEXT
research
12/31/2019

Numerical approximation of the value of a stochastic differential game with asymmetric information

We consider a convexity constrained Hamilton-Jacobi-Bellman-type obstacl...
research
03/08/2019

Learning Self-Game-Play Agents for Combinatorial Optimization Problems

Recent progress in reinforcement learning (RL) using self-game-play has ...
research
06/08/2022

ESCHER: Eschewing Importance Sampling in Games by Computing a History Value Function to Estimate Regret

Recent techniques for approximating Nash equilibria in very large games ...
research
11/18/2019

Strategy Synthesis for Surveillance-Evasion Games with Learning-Enabled Visibility Optimization

This paper studies a two-player game with a quantitative surveillance re...
research
05/13/2021

Adaptive Warm-Start MCTS in AlphaZero-like Deep Reinforcement Learning

AlphaZero has achieved impressive performance in deep reinforcement lear...
research
09/17/2018

Autonomous Exploration, Reconstruction, and Surveillance of 3D Environments Aided by Deep Learning

We study the problem of visibility-based exploration, reconstruction and...

Please sign up or login with your details

Forgot password? Click here to reset