Similarity-based Cooperation

11/26/2022
by   Caspar Oesterheld, et al.
0

As machine learning agents act more autonomously in the world, they will increasingly interact with each other. Unfortunately, in many social dilemmas like the one-shot Prisoner's Dilemma, standard game theory predicts that ML agents will fail to cooperate with each other. Prior work has shown that one way to enable cooperative outcomes in the one-shot Prisoner's Dilemma is to make the agents mutually transparent to each other, i.e., to allow them to access one another's source code (Rubinstein 1998, Tennenholtz 2004) – or weights in the case of ML agents. However, full transparency is often unrealistic, whereas partial transparency is commonplace. Moreover, it is challenging for agents to learn their way to cooperation in the full transparency setting. In this paper, we introduce a more realistic setting in which agents only observe a single number indicating how similar they are to each other. We prove that this allows for the same set of cooperative outcomes as the full transparency setting. We also demonstrate experimentally that cooperation can be learned using simple ML methods.

READ FULL TEXT

page 16

page 17

research
12/04/2020

Learning in two-player games between transparent opponents

We consider a scenario in which two reinforcement learning agents repeat...
research
12/12/2019

ABOUT ML: Annotation and Benchmarking on Understanding and Transparency of Machine Learning Lifecycles

We present the "Annotation and Benchmarking on Understanding and Transpa...
research
05/09/2019

Transparency in Maintenance of Recruitment Chatbots

We report on experiences with implementing conversational agents in the ...
research
12/23/2021

Should transparency be (in-)transparent? On monitoring aversion and cooperation in teams

Many modern organisations employ methods which involve monitoring of emp...
research
08/15/2022

Cooperative and uncooperative institution designs: Surprises and problems in open-source game theory

It is increasingly possible for real-world agents, such as software-base...
research
07/20/2023

Profit allocation in agricultural supply chains: exploring the nexus of cooperation and compensation

In this paper, we focus on decentralized agricultural supply chains cons...
research
02/03/2021

Improved Cooperation by Exploiting a Common Signal

Can artificial agents benefit from human conventions? Human societies ma...

Please sign up or login with your details

Forgot password? Click here to reset