Generic Outlier Detection in Multi-Armed Bandit

07/14/2020
by   Yikun Ban, et al.
0

In this paper, we study the problem of outlier arm detection in multi-armed bandit settings, which finds plenty of applications in many high-impact domains such as finance, healthcare, and online advertising. For this problem, a learner aims to identify the arms whose expected rewards deviate significantly from most of the other arms. Different from existing work, we target the generic outlier arms or outlier arm groups whose expected rewards can be larger, smaller, or even in between those of normal arms. To this end, we start by providing a comprehensive definition of such generic outlier arms and outlier arm groups. Then we propose a novel pulling algorithm named GOLD to identify such generic outlier arms. It builds a real-time neighborhood graph based on upper confidence bounds and catches the behavior pattern of outliers from normal arms. We also analyze its performance from various aspects. In the experiments conducted on both synthetic and real-world data sets, the proposed algorithm achieves 98 compared with state-of-the-art techniques.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/14/2012

Multiple Identifications in Multi-Armed Bandits

We study the problem of identifying the top m arms in a multi-armed band...
research
02/22/2018

Regional Multi-Armed Bandits

We consider a variant of the classic multi-armed bandit problem where th...
research
02/11/2020

Online Preselection with Context Information under the Plackett-Luce Model

We consider an extension of the contextual multi-armed bandit problem, i...
research
09/21/2020

Robust Outlier Arm Identification

We study the problem of Robust Outlier Arm Identification (ROAI), where ...
research
05/13/2020

Adaptive Double-Exploration Tradeoff for Outlier Detection

We study a variant of the thresholding bandit problem (TBP) in the conte...
research
04/25/2019

Learning to Detect an Odd Markov Arm

A multi-armed bandit with finitely many arms is studied when each arm is...
research
02/18/2020

Intelligent and Reconfigurable Architecture for KL Divergence Based Online Machine Learning Algorithm

Online machine learning (OML) algorithms do not need any training phase ...

Please sign up or login with your details

Forgot password? Click here to reset