Competitive Multi-agent Inverse Reinforcement Learning with Sub-optimal Demonstrations

01/07/2018
by   Xingyu Wang, et al.
0

This paper considers the problem of inverse reinforcement learning in zero-sum stochastic games when expert demonstrations are known to be not optimal. Compared to previous works that decouple agents in the game by assuming optimality in expert strategies, we introduce a new objective function that directly pits experts against Nash Equilibrium strategies, and we design an algorithm to solve for the reward function in the context of inverse reinforcement learning with deep neural networks as model approximations. In our setting the model and algorithm do not decouple by agent. In order to find Nash Equilibrium in large-scale games, we also propose an adversarial training algorithm for zero-sum stochastic games, and show the theoretical appeal of non-existence of local optima in its objective function. In our numerical experiments, we demonstrate that our Nash Equilibrium and inverse reinforcement learning algorithms address games that are not amenable to previous approaches using tabular representations. Moreover, with sub-optimal expert demonstrations our algorithms recover both reward functions and strategies with good quality.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/13/2023

On Faking a Nash Equilibrium

We characterize offline data poisoning attacks on Multi-Agent Reinforcem...
research
03/25/2014

Multi-agent Inverse Reinforcement Learning for Zero-sum Games

In this paper we introduce a Bayesian framework for solving a class of p...
research
05/27/2023

Reinforcement Learning With Reward Machines in Stochastic Games

We investigate multi-agent reinforcement learning for stochastic games w...
research
06/26/2018

Multi-agent Inverse Reinforcement Learning for General-sum Stochastic Games

This paper addresses the problem of multi-agent inverse reinforcement le...
research
04/23/2019

Deep Q-Learning for Nash Equilibria: Nash-DQN

Model-free learning for multi-agent stochastic games is an active area o...
research
01/05/2023

Data-Driven Inverse Reinforcement Learning for Expert-Learner Zero-Sum Games

In this paper, we formulate inverse reinforcement learning (IRL) as an e...

Please sign up or login with your details

Forgot password? Click here to reset