Strategic Inference with a Single Private Sample

09/13/2019
by   Erik Miehling, et al.
0

Motivated by applications in cyber security, we develop a simple game model for describing how a learning agent's private information influences an observing agent's inference process. The model describes a situation in which one of the agents (attacker) is deciding which of two targets to attack, one with a known reward and another with uncertain reward. The attacker receives a single private sample from the uncertain target's distribution and updates its belief of the target quality. The other agent (defender) knows the true rewards, but does not see the sample that the attacker has received. This leads to agents possessing asymmetric information: the attacker is uncertain over the parameter of the distribution, whereas the defender is uncertain about the observed sample. After the attacker updates its belief, both the attacker and the defender play a simultaneous move game based on their respective beliefs. We offer a characterization of the pure strategy equilibria of the game and explain how the players' decisions are influenced by their prior knowledge and the payoffs/costs.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

research
07/15/2023

Efficient Adversarial Attacks on Online Multi-agent Reinforcement Learning

Due to the broad range of applications of multi-agent reinforcement lear...
research
01/08/2021

Foureye: Defensive Deception based on Hypergame Theory Against Advanced Persistent Threats

Defensive deception techniques have emerged as a promising proactive def...
research
04/02/2018

Securing Infrastructure Facilities: When does proactive defense help?

Infrastructure systems are increasingly facing new security threats due ...
research
12/02/2021

Reward-Free Attacks in Multi-Agent Reinforcement Learning

We investigate how effective an attacker can be when it only learns from...
research
05/28/2019

Manipulating a Learning Defender and Ways to Counteract

In Stackelberg security games, information about the attacker's type (i....
research
04/12/2023

Optimizing Sensor Allocation against Attackers with Uncertain Intentions: A Worst-Case Regret Minimization Approach

This paper is concerned with the optimal allocation of detection resourc...
research
07/01/2021

Prophet Inequality with Competing Agents

We introduce a model of competing agents in a prophet setting, where rew...

Please sign up or login with your details

Forgot password? Click here to reset