Multi-agent Bayesian Learning with Adaptive Strategies: Convergence and Stability

10/18/2020
by   Manxi Wu, et al.
0

We study learning dynamics induced by strategic agents who repeatedly play a game with an unknown payoff-relevant parameter. In each step, an information system estimates a belief distribution of the parameter based on the players' strategies and realized payoffs using Bayes' rule. Players adjust their strategies by accounting for an equilibrium strategy or a best response strategy based on the updated belief. We prove that beliefs and strategies converge to a fixed point with probability 1. We also provide conditions that guarantee local and global stability of fixed points. Any fixed point belief consistently estimates the payoff distribution given the fixed point strategy profile. However, convergence to a complete information Nash equilibrium is not always guaranteed. We provide a sufficient and necessary condition under which fixed point belief recovers the unknown parameter. We also provide a sufficient condition for convergence to complete information equilibrium even when parameter learning is incomplete.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/02/2021

Multi-agent Bayesian Learning with Best Response Dynamics: Convergence and Stability

We study learning dynamics induced by strategic agents who repeatedly pl...
research
06/12/2022

Convergence and Stability of Coupled Belief–Strategy Learning Dynamics in Continuous Games

We propose a learning dynamics to model how strategic agents repeatedly ...
research
07/19/2012

Local stability of Belief Propagation algorithm with multiple fixed points

A number of problems in statistical physics and computer science can be ...
research
05/11/2019

Learning an Unknown Network State in Routing Games

We study learning dynamics induced by myopic travelers who repeatedly pl...
research
02/17/2010

Graph Zeta Function in the Bethe Free Energy and Loopy Belief Propagation

We propose a new approach to the analysis of Loopy Belief Propagation (L...
research
10/17/2021

Dynamic Tolling for Inducing Socially Optimal Traffic Loads

How to design tolls that induce socially optimal traffic loads with dyna...
research
01/28/2021

Equilibrium Learning in Combinatorial Auctions: Computing Approximate Bayesian Nash Equilibria via Pseudogradient Dynamics

Applications of combinatorial auctions (CA) as market mechanisms are pre...

Please sign up or login with your details

Forgot password? Click here to reset