Nearest Neighbour with Bandit Feedback

06/23/2023
by   Stephen Pasteris, et al.
0

In this paper we adapt the nearest neighbour rule to the contextual bandit problem. Our algorithm handles the fully adversarial setting in which no assumptions at all are made about the data-generation process. When combined with a sufficiently fast data-structure for (perhaps approximate) adaptive nearest neighbour search, such as a navigating net, our algorithm is extremely efficient - having a per trial running time polylogarithmic in both the number of trials and actions, and taking only quasi-linear space.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/17/2022

Fast online inference for nonlinear contextual bandit based on Generative Adversarial Network

This work addresses the efficiency concern on inferring a nonlinear cont...
research
06/05/2021

Robust Stochastic Linear Contextual Bandits Under Adversarial Attacks

Stochastic linear contextual bandit algorithms have substantial applicat...
research
08/05/2020

GeoTree: a data structure for constant time geospatial search enabling a real-time mix-adjusted median property price index

A common problem appearing across the field of data science is k-NN (k-n...
research
05/17/2021

Multiclass Classification using dilute bandit feedback

This paper introduces a new online learning framework for multiclass cla...
research
12/15/2018

A Bandit Approach to Maximum Inner Product Search

There has been substantial research on sub-linear time approximate algor...
research
07/22/2016

Optimal resampling for the noisy OneMax problem

The OneMax problem is a standard benchmark optimisation problem for a bi...
research
03/12/2009

Adaptive Mesh Approach for Predicting Algorithm Behavior with Application to Visibility Culling in Computer Graphics

We propose a concise approximate description, and a method for efficient...

Please sign up or login with your details

Forgot password? Click here to reset