Non-Cooperative Inverse Reinforcement Learning

11/03/2019
by   Xiangyuan Zhang, et al.
0

Making decisions in the presence of a strategic opponent requires one to take into account the opponent's ability to actively mask its intended objective. To describe such strategic situations, we introduce the non-cooperative inverse reinforcement learning (N-CIRL) formalism. The N-CIRL formalism consists of two agents with completely misaligned objectives, where only one of the agents knows the true objective function. Formally, we model the N-CIRL formalism as a zero-sum Markov game with one-sided incomplete information. Through interacting with the more informed player, the less informed player attempts to both infer, and act according to, the true objective function. As a result of the one-sided incomplete information, the multi-stage game can be decomposed into a sequence of single-stage games expressed by a recursive formula. Solving this recursive formula yields the value of the N-CIRL game and the more informed player's equilibrium strategy. Another recursive formula, constructed by forming an auxiliary game, termed the dual game, yields the less informed player's strategy. Building upon these two recursive formulas, we develop a computationally tractable algorithm to approximately solve for the equilibrium strategies. Finally, we demonstrate the benefits of our N-CIRL formalism over the existing multi-agent IRL formalism via extensive numerical simulation in a novel cyber security setting.

READ FULL TEXT

page 3

page 4

page 5

page 6

page 7

page 8

page 10

page 11

research
11/02/2021

Information Spillover in Multiple Zero-sum Games

This paper considers an infinitely repeated three-player Bayesian game w...
research
05/26/2020

Periodic Strategies II: Generalizations and Extensions

At a mixed Nash equilibrium, the payoff of a player does not depend on h...
research
08/24/2020

LP Formulations of Two-Player Zero-Sum Stochastic Bayesian games

This paper studies two-player zero-sum stochastic Bayesian games where e...
research
06/26/2018

Multi-agent Inverse Reinforcement Learning for General-sum Stochastic Games

This paper addresses the problem of multi-agent inverse reinforcement le...
research
04/10/2020

Deceptive Labeling: Hypergames on Graphs for Stealthy Deception

With the increasing sophistication of attacks on cyber-physical systems,...
research
09/28/2020

Zero Knowledge Games

Zero-knowledge strategies as a form of inference and reasoning operate u...
research
02/12/2019

Security-Aware Synthesis Using Delayed-Action Games

Stochastic multiplayer games (SMGs) have gained attention in the field o...

Please sign up or login with your details

Forgot password? Click here to reset