The Impact of Batch Learning in Stochastic Bandits

11/03/2021
by   Danil Provodin, et al.
12

We consider a special case of bandit problems, namely batched bandits. Motivated by natural restrictions of recommender systems and e-commerce platforms, we assume that a learning agent observes responses batched in groups over a certain time period. Unlike previous work, we consider a more practically relevant batch-centric scenario of batch learning. We provide a policy-agnostic regret analysis and demonstrate upper and lower bounds for the regret of a candidate policy. Our main theoretical results show that the impact of batch learning can be measured in terms of online behavior. Finally, we demonstrate the consistency of theoretical results by conducting empirical experiments and reflect on the optimal batch size choice.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/14/2022

The Impact of Batch Learning in Stochastic Linear Bandits

We consider a special case of bandit problems, named batched bandits, in...
research
07/04/2020

Linear Bandits with Limited Adaptivity and Learning Distributional Optimal Design

Motivated by practical needs such as large-scale learning, we study the ...
research
08/27/2020

Dynamic Batch Learning in High-Dimensional Sparse Linear Contextual Bandits

We study the problem of dynamic batch learning in high-dimensional spars...
research
02/25/2021

Batched Neural Bandits

In many sequential decision-making problems, the individuals are split i...
research
09/01/2023

Interactive and Concentrated Differential Privacy for Bandits

Bandits play a crucial role in interactive learning schemes and modern r...
research
06/15/2020

Latent Bandits Revisited

A latent bandit problem is one in which the learning agent knows the arm...
research
07/26/2019

Anonymity Mixes as (Partial) Assembly Queues: Modeling and Analysis

Anonymity platforms route the traffic over a network of special routers ...

Please sign up or login with your details

Forgot password? Click here to reset