Adaptive Mechanism Design: Learning to Promote Cooperation

06/11/2018
by   Tobias Baumann, et al.
0

In the future, artificial learning agents are likely to become increasingly widespread in our society. They will interact with both other learning agents and humans in a variety of complex settings including social dilemmas. We consider the problem of how an external agent can promote cooperation between artificial learners by distributing additional rewards and punishments based on observing the learners' actions. We propose a rule for automatically learning how to create right incentives by considering the players' anticipated parameter updates. Using this learning rule leads to cooperation with high social welfare in matrix games in which the agents would otherwise learn to defect with high probability. We show that the resulting cooperative outcome is stable in certain games even if the planning agent is turned off after a given number of episodes, while other games require ongoing intervention to maintain mutual cooperation. However, even in the latter case, the amount of necessary additional incentives decreases over time.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/20/2022

Cooperative Artificial Intelligence

In the future, artificial learning agents are likely to become increasin...
research
11/15/2018

Cooperation Enforcement and Collusion Resistance in Repeated Public Goods Games

Enforcing cooperation among substantial agents is one of the main object...
research
06/21/2020

Emergent cooperation through mutual information maximization

With artificial intelligence systems becoming ubiquitous in our society,...
research
01/22/2020

Signalling Acts of Punishment Promotes the Emergence of Cooperation and Enhanced Social Welfare in Evolutionary Games

Social punishment has been suggested as a key approach to ensuring high ...
research
01/13/2022

Nanowars can cause epidemic resurgence and fail to promote cooperation

In a non-sustainable, "over-populated" world, what might the use of nano...
research
11/24/2022

On the Emergence of Cooperation in the Repeated Prisoner's Dilemma

Using simulations between pairs of ϵ-greedy q-learners with one-period m...
research
02/21/2022

The Good Shepherd: An Oracle Agent for Mechanism Design

From social networks to traffic routing, artificial learning agents are ...

Please sign up or login with your details

Forgot password? Click here to reset