AI Chat AI Image Generator AI Video Text to Speech

How to gamble with non-stationary X-armed bandits and have no regrets

08/20/2019

∙

by Vakeriy Avanesov, et al.

∙

∙

In X-armed bandit problem an agent sequentially interacts with environment which yields a reward based on the vector input the agent provides. The agent's goal is to maximise the sum of these rewards across some number of time steps. The problem and its variations have been a subject of numerous studies, suggesting sub-linear and some times optimal strategies. The given paper introduces a novel variation of the problem. We consider an environment, which can abruptly change its behaviour an unknown number of times. To that end we propose a novel strategy and prove it attains sub-linear cumulative regret. Moreover, in case of highly smooth relation between an action and the corresponding reward, the method is nearly optimal. The theoretical result are supported by experimental study.

page 1

page 2

page 3

page 4

research

∙ 01/17/2022

A New Look at Dynamic Regret for Non-Stationary Stochastic Bandits

We study the non-stationary stochastic multi-armed bandit problem, where...

0 Yasin Abbasi-Yadkori, et al. ∙

research

∙ 10/23/2020

Finite Continuum-Armed Bandits

We consider a situation where an agent has T ressources to be allocated ...

0 Solenne Gaucher, et al. ∙

research

∙ 05/30/2021

Kolmogorov-Smirnov Test-Based Actively-Adaptive Thompson Sampling for Non-Stationary Bandits

We consider the non-stationary multi-armed bandit (MAB) framework and pr...

0 Gourab Ghatak, et al. ∙

research

∙ 05/18/2023

Discounted Thompson Sampling for Non-Stationary Bandit Problems

Non-stationary multi-armed bandit (NS-MAB) problems have recently receiv...

0 Han Qi, et al. ∙

research

∙ 11/27/2022

Rectified Pessimistic-Optimistic Learning for Stochastic Continuum-armed Bandit with Constraints

This paper studies the problem of stochastic continuum-armed bandit with...

0 Hengquan Guo, et al. ∙

research

∙ 06/08/2023

Decentralized Randomly Distributed Multi-agent Multi-armed Bandit with Heterogeneous Rewards

We study a decentralized multi-agent multi-armed bandit problem in which...

0 Mengfan Xu, et al. ∙

research

∙ 01/12/2020

Collaborative Multi-Agent Multi-Armed Bandit Learning for Small-Cell Caching

This paper investigates learning-based caching in small-cell networks (S...

0 Xianzhe Xu, et al. ∙