Achieving Correlated Equilibrium by Studying Opponent's Behavior Through Policy-Based Deep Reinforcement Learning

04/18/2020
by   Kuo Chun Tsai, et al.
0

Game theory is a very profound study on distributed decision-making behavior and has been extensively developed by many scholars. However, many existing works rely on certain strict assumptions such as knowing the opponent's private behaviors, which might not be practical. In this work, we focused on two Nobel winning concepts, the Nash equilibrium and the correlated equilibrium. Specifically, we successfully reached the correlated equilibrium outside the convex hull of the Nash equilibria with our proposed deep reinforcement learning algorithm. With the correlated equilibrium probability distribution, we also propose a mathematical model to inverse the calculation of the correlated equilibrium probability distribution to estimate the opponent's payoff vector. With those payoffs, deep reinforcement learning learns why and how the rational opponent plays, instead of just learning the regions for corresponding strategies and actions. Through simulations, we showed that our proposed method can achieve the optimal correlated equilibrium and outside the convex hull of the Nash equilibrium with limited interaction among players.

READ FULL TEXT

page 2

page 3

page 8

page 10

page 11

research
12/02/2020

Correlated Equilibria in Wireless Power Control Games

In this paper, we consider the problem of wireless power control in an i...
research
11/20/2020

Continuous Blackjack: Equilibrium, Deviation and Adaptive Strategy

We introduce a variant of the classic poker game blackjack – the continu...
research
06/15/2020

Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games

Finding approximate Nash equilibria in zero-sum imperfect-information ga...
research
01/27/2023

Are Equivariant Equilibrium Approximators Beneficial?

Recently, remarkable progress has been made by approximating Nash equili...
research
07/18/2022

A Deep Reinforcement Learning Approach for Finding Non-Exploitable Strategies in Two-Player Atari Games

This paper proposes novel, end-to-end deep reinforcement learning algori...
research
06/13/2012

Learning When to Take Advice: A Statistical Test for Achieving A Correlated Equilibrium

We study a multiagent learning problem where agents can either learn via...
research
04/26/2021

Computational Performance of Deep Reinforcement Learning to find Nash Equilibria

We test the performance of deep deterministic policy gradient (DDPG), a ...

Please sign up or login with your details

Forgot password? Click here to reset