Decision-making with Imaginary Opponent Models

11/22/2022
by   Jing Sun, et al.
0

Opponent modeling has benefited a controlled agent's decision-making by constructing models of other agents. Existing methods commonly assume access to opponents' observations and actions, which is infeasible when opponents' behaviors are unobservable or hard to obtain. We propose a novel multi-agent distributional actor-critic algorithm to achieve imaginary opponent modeling with purely local information (i.e., the controlled agent's observations, actions, and rewards). Specifically, the actor maintains a speculated belief of the opponents, which we call the imaginary opponent models, to predict opponents' actions using local observations and makes decisions accordingly. Further, the distributional critic models the return distribution of the policy. It reflects the quality of the actor and thus can guide the training of the imaginary opponent model that the actor relies on. Extensive experiments confirm that our method successfully models opponents' behaviors without their data and delivers superior performance against baseline methods with a faster convergence speed.

READ FULL TEXT
research
01/10/2023

Actor-Director-Critic: A Novel Deep Reinforcement Learning Framework

In this paper, we propose actor-director-critic, a new framework for dee...
research
01/03/2022

Asymptotic Convergence of Deep Multi-Agent Actor-Critic Algorithms

We present sufficient conditions that ensure convergence of the multi-ag...
research
06/16/2020

Local Information Opponent Modelling Using Variational Autoencoders

Modelling the behaviours of other agents (opponents) is essential for un...
research
10/06/2021

Can an AI agent hit a moving target?

As the economies we live in are evolving over time, it is imperative tha...
research
04/20/2023

Interpretability for Conditional Coordinated Behavior in Multi-Agent Reinforcement Learning

We propose a model-free reinforcement learning architecture, called dist...
research
11/09/2017

CogSciK: Clustering for Cognitive Science Motivated Decision Making

Computational models of decisionmaking must contend with the variance of...
research
01/03/2022

Monitoring and Anomaly Detection Actor-Critic Based Controlled Sensing

We address the problem of monitoring a set of binary stochastic processe...

Please sign up or login with your details

Forgot password? Click here to reset