Partial-Information Q-Learning for General Two-Player Stochastic Games

02/21/2023
by   Negash Medhin, et al.
0

In this article we analyze a partial-information Nash Q-learning algorithm for a general 2-player stochastic game. Partial information refers to the setting where a player does not know the strategy or the actions taken by the opposing player. We prove convergence of this partially informed algorithm for general 2-player games with finitely many states and actions, and we confirm that the limiting strategy is in fact a full-information Nash equilibrium. In implementation, partial information offers simplicity because it avoids computation of Nash equilibria at every time step. In contrast, full-information Q-learning uses the Lemke-Howson algorithm to compute Nash equilibria at every time step, which can be an effective approach but requires several assumptions to prove convergence and may have runtime error if Lemke-Howson encounters degeneracy. In simulations, the partial information results we obtain are comparable to those for full-information Q-learning and fictitious play.

READ FULL TEXT

page 16

page 18

research
10/26/2020

Computing Nash Equilibria in Multiplayer DAG-Structured Stochastic Games with Persistent Imperfect Information

Many important real-world settings contain multiple players interacting ...
research
08/12/2020

Convergence of Deep Fictitious Play for Stochastic Differential Games

Stochastic differential games have been used extensively to model agents...
research
01/08/2018

A Game Theoretic Approach to Autonomous Two-Player Drone Racing

To be successful in multi-player drone racing, a player must not only fo...
research
07/21/2020

Smoothed Complexity of 2-player Nash Equilibria

We prove that computing a Nash equilibrium of a two-player (n × n) game ...
research
09/30/2014

Non-Myopic Learning in Repeated Stochastic Games

This paper addresses learning in repeated stochastic games (RSGs) played...
research
07/06/2022

Concurrent Games with Multiple Topologies

Concurrent multi-player games with ω-regular objectives are a standard m...
research
01/08/2018

A Real-Time Game Theoretic Planner for Autonomous Two-Player Drone Racing

To be successful in multi-player drone racing, a player must not only fo...

Please sign up or login with your details

Forgot password? Click here to reset