Learning Implicit Communication Strategies for the Purpose of Illicit Collusion

02/16/2018
by   Aaron Goodman, et al.
0

Winner-take-all dynamics are prevalent throughout the human and natural worlds. Such dynamics promote competition between agents as they secure available resources for themselves and undermine their opponents. However, this competition is mitigated by the capacity of any single agent to amass resources on its own. Thus, agents collude with one another to ensure that one of their own can win and distribute the spoils amongst the co-conspirators. Such collusion can be difficult to execute successfully, and explicit collusion is often prohibited. Thus, competitors may attempt to collude by encoding signals in their publicly observable behavior. Such collusion happens in domains such as economics in which colluding agents establish cartels, in politics via the spoils system, and in biology where symbiotic systems can be more efficient than organisms acting alone. However, it is not known by what mechanisms agents can establish such implicit collusion strategies. In this work, we do so by extending iterated prisoner's dilemma (IPD) into a winner-take-all (WTA) framework, formalize a WTA IPD strategy as a Markov decision process, and use classical and deep reinforcement learning to discover collusion strategies in WTA IPD. We then analyze the techniques that are learned to understand how agents can develop signaling mechanisms over restricted communication channels.

READ FULL TEXT

page 6

page 7

research
04/29/2021

What is Going on Inside Recurrent Meta Reinforcement Learning Agents?

Recurrent meta reinforcement learning (meta-RL) agents are agents that e...
research
10/14/2022

Adaptive patch foraging in deep reinforcement learning agents

Patch foraging is one of the most heavily studied behavioral optimizatio...
research
02/07/2014

Frequency-Based Patrolling with Heterogeneous Agents and Limited Communication

This paper investigates multi-agent frequencybased patrolling of interse...
research
01/22/2021

Theory of Mind for Deep Reinforcement Learning in Hanabi

The partially observable card game Hanabi has recently been proposed as ...
research
05/31/2019

Interval timing in deep reinforcement learning agents

The measurement of time is central to intelligent behavior. We know that...
research
06/15/2020

ForMIC: Foraging via Multiagent RL with Implicit Communication

Multi-agent foraging (MAF) involves distributing a team of agents to sea...
research
04/27/2023

communication of information in systems of heterogenious agents and systems' dynamics

Communication of information in complex systems can be considered as maj...

Please sign up or login with your details

Forgot password? Click here to reset