A Smoothed Analysis of Online Lasso for the Sparse Linear Contextual Bandit Problem

07/16/2020
by   Zhiyuan Liu, et al.
0

We investigate the sparse linear contextual bandit problem where the parameter θ is sparse. To relieve the sampling inefficiency, we utilize the "perturbed adversary" where the context is generated adversarilly but with small random non-adaptive perturbations. We prove that the simple online Lasso supports sparse linear contextual bandit with regret bound 𝒪(√(kTlog d)) even when d ≫ T where k and d are the number of effective and ambient dimension, respectively. Compared to the recent work from Sivakumar et al. (2020), our analysis does not rely on the precondition processing, adaptive perturbation (the adaptive perturbation violates the i.i.d perturbation setting) or truncation on the error set. Moreover, the special structures in our results explicitly characterize how the perturbation affects exploration length, guide the design of perturbation together with the fundamental performance limit of perturbation method. Numerical experiments are provided to complement the theoretical analysis.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/04/2020

Nearly Dimension-Independent Sparse Linear Bandit over Small Action Spaces via Best Subset Selection

We consider the stochastic contextual bandit problem under the high dime...
research
02/03/2019

A New Algorithm for Non-stationary Contextual Bandits: Efficient, Optimal, and Parameter-free

We propose the first contextual bandit algorithm that is parameter-free,...
research
10/22/2020

Thresholded LASSO Bandit

In this paper, we revisit sparse stochastic contextual linear bandits. I...
research
05/11/2018

Online Bandit Linear Optimization: A Study

This article introduces the concepts around Online Bandit Linear Optimiz...
research
01/10/2018

A Smoothed Analysis of the Greedy Algorithm for the Linear Contextual Bandit Problem

Bandit learning is characterized by the tension between long-term explor...
research
02/21/2019

Certainty Equivalent Control of LQR is Efficient

We study the performance of the certainty equivalent controller on the L...
research
05/01/2019

On the excursion area of perturbed Gaussian fields

We investigate Lipschitz-Killing curvatures for excursion sets of random...

Please sign up or login with your details

Forgot password? Click here to reset