A Generative Machine Learning Approach to Policy Optimization in Pursuit-Evasion Games

10/04/2020
by   Shiva Navabi, et al.
0

We consider a pursuit-evasion game [11] played between two agents, 'Blue' (the pursuer) and 'Red' (the evader), over T time steps. Red aims to attack Blue's territory. Blue's objective is to intercept Red by time T and thereby limit the success of Red's attack. Blue must plan its pursuit trajectory by choosing parameters that determine its course of movement (speed and angle in our setup) such that it intercepts Red by time T. We show that Blue's path-planning problem in pursuing Red, can be posed as a sequential decision making problem under uncertainty. Blue's unawareness of Red's action policy renders the analytic dynamic programming approach intractable for finding the optimal action policy for Blue. In this work, we are interested in exploring data-driven approaches to the policy optimization problem that Blue faces. We apply generative machine learning (ML) approaches to learn optimal action policies for Blue. This highlights the ability of generative ML model to learn the relevant implicit representations for the dynamics of simulated pursuit-evasion games. We demonstrate the effectiveness of our modeling approach via extensive statistical assessments. This work can be viewed as a preliminary step towards further adoption of generative modeling approaches for addressing policy optimization problems that arise in the context of multi-agent learning and planning [1].

READ FULL TEXT
research
11/11/2018

Thompson Sampling for Pursuit-Evasion Problems

Pursuit-evasion is a multi-agent sequential decision problem wherein a g...
research
10/19/2019

Optimal Immunization Policy Using Dynamic Programming

Decisions in public health are almost always made in the context of unce...
research
04/03/2021

A Dynamics Perspective of Pursuit-Evasion Games of Intelligent Agents with the Ability to Learn

Pursuit-evasion games are ubiquitous in nature and in an artificial worl...
research
08/12/2020

Optimizing fire allocation in a NCW-type model

In this paper, we introduce a non-linear Lanchester model of NCW-type an...
research
10/25/2022

UNIFY: a Unified Policy Designing Framework for Solving Constrained Optimization Problems with Machine Learning

The interplay between Machine Learning (ML) and Constrained Optimization...
research
06/16/2022

Planning and Formulations in Pursuit-Evasion: Keep-away Games and Their Strategies

We study a pursuit-evasion problem which can be viewed as an extension o...
research
07/18/2020

Languages for modeling the RED active queue management algorithms: Modelica vs. Julia

This work is devoted to the study of the capabilities of the Modelica an...

Please sign up or login with your details

Forgot password? Click here to reset