Feature-Based Q-Learning for Two-Player Stochastic Games

06/02/2019
by   Zeyu Jia, et al.
0

Consider a two-player zero-sum stochastic game where the transition function can be embedded in a given feature space. We propose a two-player Q-learning algorithm for approximating the Nash equilibrium strategy via sampling. The algorithm is shown to find an ϵ-optimal strategy using sample size linear to the number of features. To further improve its sample efficiency, we develop an accelerated algorithm by adopting techniques such as variance reduction, monotonicity preservation and two-sided strategy approximation. We prove that the algorithm is guaranteed to find an ϵ-optimal strategy using no more than Õ(K/(ϵ^2(1-γ)^4)) samples with high probability, where K is the number of features and γ is a discount factor. The sample, time and space complexities of the algorithm are independent of original dimensions of the game.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/29/2019

Solving Discounted Stochastic Two-Player Games with Near-Optimal Time and Sample Complexity

In this paper, we settle the sampling complexity of solving discounted t...
research
10/01/2019

Parallel Algorithm for Approximating Nash Equilibrium in Multiplayer Stochastic Games with Application to Naval Strategic Planning

Many real-world domains contain multiple agents behaving strategically w...
research
01/19/2022

Anytime PSRO for Two-Player Zero-Sum Games

Policy space response oracles (PSRO) is a multi-agent reinforcement lear...
research
08/24/2020

LP Formulations of Two-Player Zero-Sum Stochastic Bayesian games

This paper studies two-player zero-sum stochastic Bayesian games where e...
research
06/08/2022

Model-Based Reinforcement Learning Is Minimax-Optimal for Offline Zero-Sum Markov Games

This paper makes progress towards learning Nash equilibria in two-player...
research
03/09/2023

Computational bounds for the 2048 game

2048 is a single player video game, played by millions mostly on mobile ...
research
08/19/2018

Intelligent Middle-Level Game Control

We propose the concept of intelligent middle-level game control, which l...

Please sign up or login with your details

Forgot password? Click here to reset