Bayes-ToMoP: A Fast Detection and Best Response Algorithm Towards Sophisticated Opponents

09/12/2018
by   Tianpei Yang, et al.
0

Multiagent algorithms often aim to accurately predict the behaviors of other agents and find a best response during interactions accordingly. Previous works usually assume an opponent uses a stationary strategy or randomly switches among several stationary ones. However, in practice, an opponent may exhibit more sophisticated behaviors by adopting more advanced strategies, e.g., using a bayesian reasoning strategy. This paper presents a novel algorithm called Bayes-ToMoP which can efficiently detect and handle opponents using either stationary or higher-level reasoning strategies. Bayes-ToMoP also supports the detection of previous unseen policies and learning a best response policy accordingly. Deep Bayes-ToMoP is proposed by extending Bayes-ToMoP with DRL techniques. Experimental results show both Bayes-ToMoP and deep Bayes-ToMoP outperform the state-of-the-art approaches when faced with different types of opponents in two-agent competitive games.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/04/2021

Model-Based Opponent Modeling

When one agent interacts with a multi-agent environment, it is challengi...
research
07/01/2003

AWESOME: A General Multiagent Learning Algorithm that Converges in Self-Play and Learns a Best Response Against Stationary Opponents

A satisfactory multiagent learning algorithm should, at a minimum, lear...
research
05/30/2019

An Efficient Detection of Malware by Naive Bayes Classifier Using GPGPU

Due to continuous increase in the number of malware (according to AV-Tes...
research
06/30/2020

R2-B2: Recursive Reasoning-Based Bayesian Optimization for No-Regret Learning in Games

This paper presents a recursive reasoning formalism of Bayesian optimiza...
research
05/26/2023

A Hierarchical Approach to Population Training for Human-AI Collaboration

A major challenge for deep reinforcement learning (DRL) agents is to col...
research
05/31/2022

Simplex NeuPL: Any-Mixture Bayes-Optimality in Symmetric Zero-sum Games

Learning to play optimally against any mixture over a diverse set of str...

Please sign up or login with your details

Forgot password? Click here to reset