Team Behavior in Interactive Dynamic Influence Diagrams with Applications to Ad Hoc Teams

Planning for ad hoc teamwork is challenging because it involves agents collaborating without any prior coordination or communication. The focus is on principled methods for a single agent to cooperate with others. This motivates investigating the ad hoc teamwork problem in the context of individual decision making frameworks. However, individual decision making in multiagent settings faces the task of having to reason about other agents' actions, which in turn involves reasoning about others. An established approximation that operationalizes this approach is to bound the infinite nesting from below by introducing level 0 models. We show that a consequence of the finitely-nested modeling is that we may not obtain optimal team solutions in cooperative settings. We address this limitation by including models at level 0 whose solutions involve learning. We demonstrate that the learning integrated into planning in the context of interactive dynamic influence diagrams facilitates optimal team behavior, and is applicable to ad hoc teamwork.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/24/2022

Knowledge-based and Data-driven Reasoning and Learning for Ad Hoc Teamwork

We present an architecture for ad hoc teamwork, which refers to collabor...
research
09/08/2022

The Utility of Explainable AI in Ad Hoc Human-Machine Teaming

Recent advances in machine learning have led to growing interest in Expl...
research
09/20/2018

Ad hoc Teamwork and Moral Feedback as a Framework for Safe Agent Behavior

As technology develops, it is only a matter of time before agents will b...
research
05/20/2023

Joining the Conversation: Towards Language Acquisition for Ad Hoc Team Play

In this paper, we propose and consider the problem of cooperative langua...
research
09/20/2018

Teaching Social Behavior through Human Reinforcement for Ad hoc Teamwork -The STAR Framework

As technology develops, it is only a matter of time before agents will b...
research
01/18/2014

Exploiting Model Equivalences for Solving Interactive Dynamic Influence Diagrams

We focus on the problem of sequential decision making in partially obser...
research
06/19/2023

Controlling Type Confounding in Ad Hoc Teamwork with Instance-wise Teammate Feedback Rectification

Ad hoc teamwork requires an agent to cooperate with unknown teammates wi...

Please sign up or login with your details

Forgot password? Click here to reset