Balanced Linear Contextual Bandits

12/15/2018
by   Maria Dimakopoulou, et al.
0

Contextual bandit algorithms are sensitive to the estimation method of the outcome model as well as the exploration method used, particularly in the presence of rich heterogeneity or complex outcome models, which can lead to difficult estimation problems along the path of learning. We develop algorithms for contextual bandits with linear payoffs that integrate balancing methods from the causal inference literature in their estimation to make it less prone to problems of estimation bias. We provide the first regret bound analyses for linear contextual bandits with balancing and show that our algorithms match the state of the art theoretical guarantees. We demonstrate the strong practical advantage of balanced contextual bandits on a large number of supervised learning datasets and on a synthetic example that simulates model misspecification and prejudice in the initial training data.

READ FULL TEXT
research
11/19/2017

Estimation Considerations in Contextual Bandits

Contextual bandit algorithms seek to learn a personalized treatment assi...
research
11/09/2017

Action Centered Contextual Bandits

Contextual bandits have become popular as they offer a middle ground bet...
research
02/20/2019

AdaLinUCB: Opportunistic Learning for Contextual Bandits

In this paper, we propose and study opportunistic contextual bandits - a...
research
06/09/2022

Conformal Off-Policy Prediction in Contextual Bandits

Most off-policy evaluation methods for contextual bandits have focused o...
research
02/21/2020

Online Learning in Contextual Bandits using Gated Linear Networks

We introduce a new and completely online contextual bandit algorithm cal...
research
05/06/2021

Contextual Bandits with Sparse Data in Web setting

This paper is a scoping study to identify current methods used in handli...
research
05/30/2019

Rarely-switching linear bandits: optimization of causal effects for the real world

Exploring the effect of policies in many real world scenarios is difficu...

Please sign up or login with your details

Forgot password? Click here to reset