Learning Models of Adversarial Agent Behavior under Partial Observability

06/19/2023
by   Sean Ye, et al.
0

The need for opponent modeling and tracking arises in several real-world scenarios, such as professional sports, video game design, and drug-trafficking interdiction. In this work, we present Graph based Adversarial Modeling with Mutal Information (GrAMMI) for modeling the behavior of an adversarial opponent agent. GrAMMI is a novel graph neural network (GNN) based approach that uses mutual information maximization as an auxiliary objective to predict the current and future states of an adversarial opponent with partial observability. To evaluate GrAMMI, we design two large-scale, pursuit-evasion domains inspired by real-world scenarios, where a team of heterogeneous agents is tasked with tracking and interdicting a single adversarial agent, and the adversarial agent must evade detection while achieving its own objectives. With the mutual information formulation, GrAMMI outperforms all baselines in both domains and achieves 31.68 adversarial state predictions across both domains.

READ FULL TEXT

page 1

page 6

research
06/21/2020

Emergent cooperation through mutual information maximization

With artificial intelligence systems becoming ubiquitous in our society,...
research
05/16/2020

Mutual Information Maximization for Robust Plannable Representations

Extending the capabilities of robotics to real-world complex, unstructur...
research
03/10/2021

Hard Attention Control By Mutual Information Maximization

Biological agents have adopted the principle of attention to limit the r...
research
01/16/2020

MIME: Mutual Information Minimisation Exploration

We show that reinforcement learning agents that learn by surprise (surpr...
research
01/07/2019

Distributed Learning with Adversarial Agents Under Relaxed Network Condition

This work studies the problem of non-Bayesian learning over multi-agent ...
research
07/01/2021

Unsupervised Image Segmentation by Mutual Information Maximization and Adversarial Regularization

Semantic segmentation is one of the basic, yet essential scene understan...
research
02/25/2019

Stochastic Prediction of Multi-Agent Interactions from Partial Observations

We present a method that learns to integrate temporal information, from ...

Please sign up or login with your details

Forgot password? Click here to reset