Reasoning about Hypothetical Agent Behaviours and their Parameters

06/26/2019
by   Stefano V. Albrecht, et al.
0

Agents can achieve effective interaction with previously unknown other agents by maintaining beliefs over a set of hypothetical behaviours, or types, that these agents may have. A current limitation in this method is that it does not recognise parameters within type specifications, because types are viewed as blackbox mappings from interaction histories to probability distributions over actions. In this work, we propose a general method which allows an agent to reason about both the relative likelihood of types and the values of any bounded continuous parameters within types. The method maintains individual parameter estimates for each type and selectively updates the estimates for some types after each observation. We propose different methods for the selection of types and the estimation of parameter values. The proposed methods are evaluated in detailed experiments, showing that updating the parameter estimates of a single type after each observation can be sufficient to achieve good performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/28/2015

Belief and Truth in Hypothesised Behaviours

There is a long history in game theory on the topic of Bayesian or "rati...
research
01/16/2014

Iterated Belief Change Due to Actions and Observations

In action domains where agents may have erroneous beliefs, reasoning abo...
research
02/08/2023

Policy Evaluation in Decentralized POMDPs with Belief Sharing

Most works on multi-agent reinforcement learning focus on scenarios wher...
research
07/15/2019

On Convergence and Optimality of Best-Response Learning with Policy Types in Multiagent Systems

While many multiagent algorithms are designed for homogeneous systems (i...
research
10/06/2021

Efficient Multi-agent Epistemic Planning: Teaching Planners About Nested Belief

Many AI applications involve the interaction of multiple autonomous agen...
research
12/15/2022

Networks of reinforced stochastic processes: estimation of the probability of asymptotic polarization

In a network of reinforced stochastic processes [arXiv:2206.07514, arXiv...
research
03/21/2020

Crowdsourced Labeling for Worker-Task Specialization Block Model

We consider crowdsourced labeling under a worker-task specialization blo...

Please sign up or login with your details

Forgot password? Click here to reset