Optimal Activation of Halting Multi-Armed Bandit Models

04/20/2023
by   Wesley Cowan, et al.
0

We study new types of dynamic allocation problems the Halting Bandit models. As an application, we obtain new proofs for the classic Gittins index decomposition result and recent results of the authors in `Multi-armed bandits under general depreciation and commitment.'

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/17/2017

Online Multi-Armed Bandit

We introduce a novel variant of the multi-armed bandit problem, in which...
research
03/07/2022

PAC-Bayesian Lifelong Learning For Multi-Armed Bandits

We present a PAC-Bayesian analysis of lifelong learning. In the lifelong...
research
09/11/2019

Practical Calculation of Gittins Indices for Multi-armed Bandits

Gittins indices provide an optimal solution to the classical multi-armed...
research
08/17/2020

Using Subjective Logic to Estimate Uncertainty in Multi-Armed Bandit Problems

The multi-armed bandit problem is a classical decision-making problem wh...
research
06/11/2023

Multi-Source Test-Time Adaptation as Dueling Bandits for Extractive Question Answering

In this work, we study multi-source test-time model adaptation from user...
research
04/10/2013

Sustainable Cooperative Coevolution with a Multi-Armed Bandit

This paper proposes a self-adaptation mechanism to manage the resources ...
research
06/10/2021

A Central Limit Theorem, Loss Aversion and Multi-Armed Bandits

This paper establishes a central limit theorem under the assumption that...

Please sign up or login with your details

Forgot password? Click here to reset