Online learning with Corrupted context: Corrupted Contextual Bandits

06/26/2020
by   Djallel Bouneffouf, et al.
0

We consider a novel variant of the contextual bandit problem (i.e., the multi-armed bandit with side-information, or context, available to a decision-maker) where the context used at each decision may be corrupted ("useless context"). This new problem is motivated by certain on-line settings including clinical trial and ad recommendation applications. In order to address the corrupted-context setting,we propose to combine the standard contextual bandit approach with a classical multi-armed bandit mechanism. Unlike standard contextual bandit methods, we are able to learn from all iteration, even those with corrupted context, by improving the computing of the expectation for each arm. Promising empirical results are obtained on several real-life datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/13/2020

Contextual Bandit with Missing Rewards

We consider a novel variant of the contextual bandit problem (i.e., the ...
research
05/10/2017

Context Attentive Bandits: Contextual Bandit with Restricted Context

We consider a novel formulation of the multi-armed bandit model, which w...
research
07/15/2023

On the Robustness of Epoch-Greedy in Multi-Agent Contextual Bandit Mechanisms

Efficient learning in multi-armed bandit mechanisms such as pay-per-clic...
research
11/16/2020

Corrupted Contextual Bandits with Action Order Constraints

We consider a variant of the novel contextual bandit problem with corrup...
research
11/06/2018

contextual: Evaluating Contextual Multi-Armed Bandit Problems in R

Over the past decade, contextual bandit algorithms have been gaining in ...
research
02/26/2020

Designing Truthful Contextual Multi-Armed Bandits based Sponsored Search Auctions

For sponsored search auctions, we consider contextual multi-armed bandit...
research
08/21/2019

Exploring Offline Policy Evaluation for the Continuous-Armed Bandit Problem

The (contextual) multi-armed bandit problem (MAB) provides a formalizati...

Please sign up or login with your details

Forgot password? Click here to reset