Variable Selection via Thompson Sampling

07/01/2020
by   YI LIU, et al.
0

Thompson sampling is a heuristic algorithm for the multi-armed bandit problem which has a long tradition in machine learning. The algorithm has a Bayesian spirit in the sense that it selects arms based on posterior samples of reward probabilities of each arm. By forging a connection between combinatorial binary bandits and spike-and-slab variable selection, we propose a stochastic optimization approach to subset selection called Thompson Variable Selection (TVS). TVS is a framework for interpretable machine learning which does not rely on the underlying model to be linear. TVS brings together Bayesian reinforcement and machine learning in order to extend the reach of Bayesian subset selection to non-parametric models and large datasets with very many predictors and/or very many observations. Depending on the choice of a reward, TVS can be deployed in offline as well as online setups with streaming data batches. Tailoring multiplay bandits to variable selection, we provide regret bounds without necessarily assuming that the arm mean rewards be unrelated. We show a very strong empirical performance on both simulated and real data. Unlike deterministic optimization methods for spike-and-slab variable selection, the stochastic nature makes TVS less prone to local convergence and thereby more robust.

READ FULL TEXT

page 25

page 27

page 30

research
12/18/2018

Comparing Spike and Slab Priors for Bayesian Variable Selection

An important task in building regression models is to decide which regre...
research
05/22/2021

Combinatorial Blocking Bandits with Stochastic Delays

Recent work has considered natural variations of the multi-armed bandit ...
research
08/08/2018

Nonparametric Gaussian mixture models for the multi-armed contextual bandit

The multi-armed bandit is a sequential allocation task where an agent mu...
research
09/17/2020

Dimension Reduction in Contextual Online Learning via Nonparametric Variable Selection

We consider a contextual online learning (multi-armed bandit) problem wi...
research
06/06/2018

ABC Variable Selection with Bayesian Forests

Few problems in statistics are as perplexing as variable selection in th...
research
03/22/2023

Scalable Bayesian bi-level variable selection in generalized linear models

Motivated by a real-world application in cardiology, we develop an algor...
research
10/15/2017

Variable selection for (realistic) stochastic blockmodels

Stochastic blockmodels provide a convenient representation of relations ...

Please sign up or login with your details

Forgot password? Click here to reset