Efficient Linear Bandits through Matrix Sketching

09/28/2018
by   Ilja Kuzborskij, et al.
0

We prove that two popular linear contextual bandit algorithms, OFUL and Thompson Sampling, can be made efficient using Frequent Directions, a deterministic online sketching technique. More precisely, we show that a sketch of size m allows a O(md) update time for both algorithms, as opposed to Ω(d^2) required by their non-sketched versions (where d is the dimension of context vectors). When the selected contexts span a subspace of dimension at most m, we show that this computational speedup is accompanied by an improved regret of order m√(T) for sketched OFUL and of order m√(dT) for sketched Thompson Sampling (ignoring log factors in both cases). Vice versa, when the dimension of the span exceeds m, the regret bounds become of order (1+ε_m)^3/2d√(T) for OFUL and of order ((1+ε_m)d)^3/2√(T) for Thompson Sampling, where ε_m is bounded by the sum of the tail eigenvalues not covered by the sketch. Experiments on real-world datasets corroborate our theoretical results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/05/2020

Stochastic Linear Contextual Bandits with Diverse Contexts

In this paper, we investigate the impact of context diversity on stochas...
research
06/11/2022

Squeeze All: Novel Estimator and Self-Normalized Bound for Linear Contextual Bandits

We propose a novel algorithm for linear contextual bandits with O(√(dT l...
research
03/16/2023

On the Interplay Between Misspecification and Sub-optimality Gap in Linear Contextual Bandits

We study linear contextual bandits in the misspecified setting, where th...
research
04/20/2020

Thompson Sampling for Linearly Constrained Bandits

We address multi-armed bandits (MAB) where the objective is to maximize ...
research
05/12/2019

On the Performance of Thompson Sampling on Logistic Bandits

We study the logistic bandit, in which rewards are binary with success p...
research
02/11/2022

Efficient Kernel UCB for Contextual Bandits

In this paper, we tackle the computational efficiency of kernelized UCB ...
research
02/07/2023

Sketchy: Memory-efficient Adaptive Regularization with Frequent Directions

Adaptive regularization methods that exploit more than the diagonal entr...

Please sign up or login with your details

Forgot password? Click here to reset