Fast online inference for nonlinear contextual bandit based on Generative Adversarial Network

02/17/2022
by   Yun Da Tsai, et al.
0

This work addresses the efficiency concern on inferring a nonlinear contextual bandit when the number of arms n is very large. We propose a neural bandit model with an end-to-end training process to efficiently perform bandit algorithms such as Thompson Sampling and UCB during inference. We advance state-of-the-art time complexity to O(log n) with approximate Bayesian inference, neural random feature mapping, approximate global maxima and approximate nearest neighbor search. We further propose a generative adversarial network to shift the bottleneck of maximizing the objective for selecting optimal arms from inference time to training time, enjoying significant speedup with additional advantage of enabling batch and parallel processing. posterior sampling in logarithmic time complexity with the help of approximate nearest neighbor search. Extensive experiments on classification and recommendation tasks demonstrate order-of-magnitude improvement in inference time no significant degradation on the performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/26/2018

An Algorithm for Reducing Approximate Nearest Neighbor to Approximate Near Neighbor with O(logn) Query Time

This paper proposes a new algorithm for reducing Approximate Nearest Nei...
research
06/23/2023

Nearest Neighbour with Bandit Feedback

In this paper we adapt the nearest neighbour rule to the contextual band...
research
03/03/2021

Linear Bandit Algorithms with Sublinear Time Complexity

We propose to accelerate existing linear bandit algorithms to achieve pe...
research
07/25/2018

Local Orthogonal-Group Testing

This work addresses approximate nearest neighbor search applied in the d...
research
01/10/2022

Tree-based Search Graph for Approximate Nearest Neighbor Search

Nearest neighbor search supports important applications in many domains,...
research
01/30/2013

Logarithmic Time Parallel Bayesian Inference

I present a parallel algorithm for exact probabilistic inference in Baye...
research
07/11/2017

Fast Amortized Inference and Learning in Log-linear Models with Randomly Perturbed Nearest Neighbor Search

Inference in log-linear models scales linearly in the size of output spa...

Please sign up or login with your details

Forgot password? Click here to reset