Controlling Type Confounding in Ad Hoc Teamwork with Instance-wise Teammate Feedback Rectification

06/19/2023
by   Dong Xing, et al.
0

Ad hoc teamwork requires an agent to cooperate with unknown teammates without prior coordination. Many works propose to abstract teammate instances into high-level representation of types and then pre-train the best response for each type. However, most of them do not consider the distribution of teammate instances within a type. This could expose the agent to the hidden risk of type confounding. In the worst case, the best response for an abstract teammate type could be the worst response for all specific instances of that type. This work addresses the issue from the lens of causal inference. We first theoretically demonstrate that this phenomenon is due to the spurious correlation brought by uncontrolled teammate distribution. Then, we propose our solution, CTCAT, which disentangles such correlation through an instance-wise teammate feedback rectification. This operation reweights the interaction of teammate instances within a shared type to reduce the influence of type confounding. The effect of CTCAT is evaluated in multiple domains, including classic ad hoc teamwork tasks and real-world scenarios. Results show that CTCAT is robust to the influence of type confounding, a practical issue that directly hazards the robustness of our trained agents but was unnoticed in previous works.

READ FULL TEXT

page 6

page 13

research
07/28/2022

Towards Robust Ad Hoc Teamwork Agents By Creating Diverse Training Teammates

Ad hoc teamwork (AHT) is the problem of creating an agent that must coll...
research
08/09/2022

Ad Hoc Teamwork in the Presence of Adversaries

Advances in ad hoc teamwork have the potential to create agents that col...
research
06/03/2015

A Game-Theoretic Model and Best-Response Learning Method for Ad Hoc Coordination in Multiagent Systems

The ad hoc coordination problem is to design an autonomous agent which i...
research
09/01/2014

Team Behavior in Interactive Dynamic Influence Diagrams with Applications to Ad Hoc Teams

Planning for ad hoc teamwork is challenging because it involves agents c...
research
08/17/2018

Proving Type Class Laws for Haskell

Type classes in Haskell are used to implement ad-hoc polymorphism, i.e. ...
research
01/10/2022

Assisting Unknown Teammates in Unknown Tasks: Ad Hoc Teamwork under Partial Observability

In this paper, we present a novel Bayesian online prediction algorithm f...
research
02/01/2021

Classifications based on response times for detecting early-stage Alzheimer's disease

Introduction: This paper mainly describes a way to detect with high accu...

Please sign up or login with your details

Forgot password? Click here to reset