Sample Complexity of an Adversarial Attack on UCB-based Best-arm Identification Policy

09/13/2022
by   Varsha Pendyala, et al.
0

In this work I study the problem of adversarial perturbations to rewards, in a Multi-armed bandit (MAB) setting. Specifically, I focus on an adversarial attack to a UCB type best-arm identification policy applied to a stochastic MAB. The UCB attack presented in [1] results in pulling a target arm K very often. I used the attack model of [1] to derive the sample complexity required for selecting target arm K as the best arm. I have proved that the stopping condition of UCB based best-arm identification algorithm given in [2], can be achieved by the target arm K in T rounds, where T depends only on the total number of arms and σ parameter of σ^2- sub-Gaussian random rewards of the arms.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/09/2020

Streaming Algorithms for Stochastic Multi-armed Bandits

We study the Stochastic Multi-armed Bandit problem under bounded arm-mem...
research
01/31/2019

A Bad Arm Existence Checking Problem

We study a bad arm existing checking problem in which a player's task is...
research
12/26/2022

Gaussian Process Classification Bandits

Classification bandits are multi-armed bandit problems whose task is to ...
research
09/07/2016

Random Shuffling and Resets for the Non-stationary Stochastic Bandit Problem

We consider a non-stationary formulation of the stochastic multi-armed b...
research
09/01/2023

Fast and Regret Optimal Best Arm Identification: Fundamental Limits and Low-Complexity Algorithms

This paper considers a stochastic multi-armed bandit (MAB) problem with ...
research
06/06/2021

PAC Best Arm Identification Under a Deadline

We study (ϵ, δ)-PAC best arm identification, where a decision-maker must...
research
10/03/2022

Dealing with Unknown Variances in Best-Arm Identification

The problem of identifying the best arm among a collection of items havi...

Please sign up or login with your details

Forgot password? Click here to reset