Learning Stackelberg Equilibria and Applications to Economic Design Games

10/07/2022
by   Gianluca Brero, et al.
0

We study the use of reinforcement learning to learn the optimal leader's strategy in Stackelberg games. Learning a leader's strategy has an innate stationarity problem – when optimizing the leader's strategy, the followers' strategies might shift. To circumvent this problem, we model the followers via no-regret dynamics to converge to a Bayesian Coarse-Correlated Equilibrium (B-CCE) of the game induced by the leader. We then embed the followers' no-regret dynamics in the leader's learning environment, which allows us to formulate our learning problem as a standard POMDP. We prove that the optimal policy of this POMDP achieves the same utility as the optimal leader's strategy in our Stackelberg game. We solve this POMDP using actor-critic methods, where the critic is given access to the joint information of all the agents. Finally, we show that our methods are able to learn optimal leader strategies in a variety of settings of increasing complexity, including indirect mechanisms where the leader's strategy is setting up the mechanism's rules.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/25/2021

Stackelberg Actor-Critic: Game-Theoretic Reinforcement Learning Algorithms

The hierarchical interaction between the actor and critic in actor-criti...
research
02/10/2022

No-Regret Learning in Dynamic Stackelberg Games

In a Stackelberg game, a leader commits to a randomized strategy, and a ...
research
03/07/2019

Imitative Follower Deception in Stackelberg Games

Uncertainty is one of the major challenges facing applications of game t...
research
08/08/2022

Peer Prediction for Learning Agents

Peer prediction refers to a collection of mechanisms for eliciting infor...
research
12/22/2022

Commitment with Signaling under Double-sided Information Asymmetry

Information asymmetry in games enables players with the information adva...
research
05/28/2019

Robust Commitments and Partial Reputation

Agents rarely act in isolation -- their behavioral history, in particula...
research
08/15/2023

Active Inverse Learning in Stackelberg Trajectory Games

Game-theoretic inverse learning is the problem of inferring the players'...

Please sign up or login with your details

Forgot password? Click here to reset