Survey Bandits with Regret Guarantees

02/23/2020

∙

We consider a variant of the contextual bandit problem. In standard contextual bandits, when a user arrives we get the user's complete feature vector and then assign a treatment (arm) to that user. In a number of applications (like healthcare), collecting features from users can be costly. To address this issue, we propose algorithms that avoid needless feature collection while maintaining strong regret guarantees.

READ FULL TEXT

Survey Bandits with Regret Guarantees

Sign in with Google

Consider DeepAI Pro