On Greedy and Strategic Evaders in Sequential Interdiction Settings with Incomplete Information

10/13/2018
by   Sergey S. Ketkov, et al.
0

We consider a class of sequential interdiction settings where the interdictor has incomplete initial information about the network while the evader has complete knowledge of the network, including its structure and arc costs. In each time period, the interdictor can block at most k arcs known to him for the duration of the period. By observing the evader's actions, the interdictor learns about the network structure and costs and thus, can adjust his actions in subsequent time periods. It is known from the literature that if the evader is greedy (i.e., she uses the shortest available path in every time period), then under some assumptions on the the feedback generated from the evader's actions, the greedy interdiction policies that block k-most vital arcs in each time period are efficient and have a finite regret. In this paper, we consider the evader's perspective and explore strategic evasion policies under the assumption that the interdictor is greedy. In particular, we show that the evader's problem is NP-hard even for two time periods. Then we derive constructive properties of optimal evasion policies for two time periods when the interdictor has no initial information about the network structure. Based on these theoretical observations, we design a heuristic algorithm for a strategic evader in a general setting with an arbitrary time horizon and any initial information available to the interdictor initially. Finally, our computational experiments demonstrate that the proposed heuristic consistently outperforms the greedy evasion policy on several classes of synthetic network instances.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/22/2019

Optimal Decision Making Under Strategic Behavior

We are witnessing an increasing use of data-driven predictive models to ...
research
04/26/2017

Reward Maximization Under Uncertainty: Leveraging Side-Observations on Networks

We study the stochastic multi-armed bandit (MAB) problem in the presence...
research
02/23/2016

Submodular Learning and Covering with Response-Dependent Costs

We consider interactive learning and covering problems, in a setting whe...
research
01/19/2012

Adaptive Policies for Sequential Sampling under Incomplete Information and a Cost Constraint

We consider the problem of sequential sampling from a finite number of i...
research
11/08/2017

Information Directed Sampling for Stochastic Bandits with Graph Feedback

We consider stochastic multi-armed bandit problems with graph feedback, ...
research
07/21/2023

Complexity of Conformant Election Manipulation

It is important to study how strategic agents can affect the outcome of ...
research
09/09/2011

Integrating Learning from Examples into the Search for Diagnostic Policies

This paper studies the problem of learning diagnostic policies from trai...

Please sign up or login with your details

Forgot password? Click here to reset