Dimension Reduction in Contextual Online Learning via Nonparametric Variable Selection

09/17/2020
by   Wenhao Li, et al.
0

We consider a contextual online learning (multi-armed bandit) problem with high-dimensional covariate 𝐱 and decision 𝐲. The reward function to learn, f(𝐱,𝐲), does not have a particular parametric form. The literature has shown that the optimal regret is Õ(T^(d_x+d_y+1)/(d_x+d_y+2)), where d_x and d_y are the dimensions of 𝐱 and 𝐲, and thus it suffers from the curse of dimensionality. In many applications, only a small subset of variables in the covariate affect the value of f, which is referred to as sparsity in statistics. To take advantage of the sparsity structure of the covariate, we propose a variable selection algorithm called BV-LASSO, which incorporates novel ideas such as binning and voting to apply LASSO to nonparametric settings. Our algorithm achieves the regret Õ(T^(d_x^*+d_y+1)/(d_x^*+d_y+2)), where d_x^* is the effective covariate dimension. The regret matches the optimal regret when the covariate is d^*_x-dimensional and thus cannot be improved. Our algorithm may serve as a general recipe to achieve dimension reduction via variable selection in nonparametric settings.

READ FULL TEXT
research
12/07/2018

Online Learning and Decision-Making under Generalized Linear Model with High-Dimensional Data

We propose a minimax concave penalized multi-armed bandit algorithm unde...
research
05/03/2018

Nonparametric Learning and Optimization with Covariates

Modern decision analytics frequently involves the optimization of an obj...
research
07/15/2019

A Dimension-free Algorithm for Contextual Continuum-armed Bandits

In contextual continuum-armed bandits, the contexts x and the arms y are...
research
07/01/2020

Variable Selection via Thompson Sampling

Thompson sampling is a heuristic algorithm for the multi-armed bandit pr...
research
10/19/2021

Operator-induced structural variable selection for identifying materials genes

In the emerging field of materials informatics, a fundamental task is to...
research
07/16/2020

Sparsity-Agnostic Lasso Bandit

We consider a stochastic contextual bandit problem where the dimension d...
research
05/30/2023

Cooperative Thresholded Lasso for Sparse Linear Bandit

We present a novel approach to address the multi-agent sparse contextual...

Please sign up or login with your details

Forgot password? Click here to reset