Intense Competition can Drive Selfish Explorers to Optimize Coverage

05/03/2018
by   Simon Collet, et al.
0

We consider a game-theoretic setting in which selfish individuals compete over resources of varying quality. The motivating example is a group of animals that disperse over patches of food of different abundances. In such scenarios, individuals are biased towards selecting the higher quality patches, while, at the same time, aiming to avoid costly collisions or overlaps. Our goal is to investigate the impact of collision costs on the parallel coverage of resources by the whole group. Consider M sites, where a site x has value f(x). We think of f(x) as the reward associated with site x, and assume that if a single individual visits x exclusively, it receives this exact reward. Typically, we assume that if > 1 individuals visit x then each receives at most f(x). In particular, when competition costs are high, each individual might receive an amount strictly less than f(x), which could even be negative. Conversely, modeling cooperation at a site, we also consider cases where each one gets more than f(x). There are k identical players that compete over the rewards. They independently act in parallel, in a one-shot scenario, each specifying a single site to visit, without knowing which sites are explored by others. The group performance is evaluated by the expected coverage, defined as the sum of f(x) over all sites that are explored by at least one player. Since we assume that players cannot coordinate before choosing their site we focus on symmetric strategies. The main takeaway message of this paper is that the optimal symmetric coverage is expected to emerge when collision costs are relatively high, so that the following "Judgment of Solomon" type of rule holds: If a single player explores a site x then it gains its full reward f(x), but if several players explore it, then neither one receives any reward. Under this policy, it turns out that there exists a unique symmetric Nash Equilibrium strategy, which is, in fact, evolutionary stable. Moreover, this strategy yields the best possible coverage among all symmetric strategies. Viewing the coverage measure as the social welfare, this policy thus enjoys a (Symmetric) Price of Anarchy of precisely 1, whereas, in fact, any other congestion policy has a price strictly greater than 1. Our model falls within the scope of mechanism design, and more precisely in the area of incentivizing exploration. It finds relevance in evolutionary ecology, and further connects to studies on Bayesian parallel search algorithms.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/03/2018

Cooperative Search Games: Symmetric Equilibria, Robustness, and Price of Anarchy

Assume that a treasure is placed in one of M boxes according to a known ...
research
08/03/2019

Multiplayer Bandit Learning, from Competition to Cooperation

The stochastic multi-armed bandit problem is a classic model illustratin...
research
10/26/2018

Distributed Multi-Player Bandits - a Game of Thrones Approach

We consider a multi-armed bandit game where N players compete for K arms...
research
02/04/2020

Selfish Robustness and Equilibria in Multi-Player Bandits

Motivated by cognitive radios, stochastic multi-player multi-armed bandi...
research
02/26/2018

Information Provision in a Sequential Search Setting

Consider a variation on the classic Weitzman search problem, in which fi...
research
02/01/2021

Stochastic Alignment Processes

The tendency to align to others is inherent to social behavior, includin...

Please sign up or login with your details

Forgot password? Click here to reset