Identity Concealment Games: How I Learned to Stop Revealing and Love the Coincidences

05/12/2021
by   Mustafa O. Karabag, et al.
0

In an adversarial environment, a hostile player performing a task may behave like a non-hostile one in order not to reveal its identity to an opponent. To model such a scenario, we define identity concealment games: zero-sum stochastic reachability games with a zero-sum objective of identity concealment. To measure the identity concealment of the player, we introduce the notion of an average player. The average player's policy represents the expected behavior of a non-hostile player. We show that there exists an equilibrium policy pair for every identity concealment game and give the optimality equations to synthesize an equilibrium policy pair. If the player's opponent follows a non-equilibrium policy, the player can hide its identity better. For this reason, we study how the hostile player may learn the opponent's policy. Since learning via exploration policies would quickly reveal the hostile player's identity to the opponent, we consider the problem of learning a near-optimal policy for the hostile player using the game runs collected under the average player's policy. Consequently, we propose an algorithm that provably learns a near-optimal policy and give an upper bound on the number of sample runs to be collected.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/04/2020

Off-Policy Exploitability-Evaluation and Equilibrium-Learning in Two-Player Zero-Sum Markov Games

Off-policy evaluation (OPE) is the problem of evaluating new policies us...
research
02/17/2021

Provably Efficient Policy Gradient Methods for Two-Player Zero-Sum Markov Games

Policy gradient methods are widely used in solving two-player zero-sum g...
research
11/02/2021

Information Spillover in Multiple Zero-sum Games

This paper considers an infinitely repeated three-player Bayesian game w...
research
08/23/2019

Stackelberg Punishment and Bully-Proofing Autonomous Vehicles

Mutually beneficial behavior in repeated games can be enforced via the t...
research
06/22/2020

Near-Optimal Reinforcement Learning with Self-Play

This paper considers the problem of designing optimal algorithms for rei...
research
02/12/2018

Equilibrium solutions of three player Kuhn poker with N>3 cards: A new numerical method using regularization and arc-length continuation

We study the equilibrium solutions of three player Kuhn poker with N>3 c...
research
11/27/2022

Strategically revealing capabilities in General Lotto games

Can revealing one's competitive capabilities to an opponent offer strate...

Please sign up or login with your details

Forgot password? Click here to reset