Efficient Contextual Bandits with Continuous Actions

06/10/2020
by   Maryam Majzoubi, et al.
0

We create a computationally tractable algorithm for contextual bandits with continuous actions having unknown structure. Our reduction-style algorithm composes with most supervised learning representations. We prove that it works in a general sense and verify the new functionality with large-scale experiments.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/14/2022

Efficient Contextual Bandits with Knapsacks via Regression

We consider contextual bandits with knapsacks (CBwK), a variant of the c...
research
06/13/2023

Oracle-Efficient Pessimism: Offline Policy Optimization in Contextual Bandits

We consider policy optimization in contextual bandits, where one is give...
research
02/11/2022

Efficient Kernel UCB for Contextual Bandits

In this paper, we tackle the computational efficiency of kernelized UCB ...
research
07/12/2022

Contextual Bandits with Large Action Spaces: Made Practical

A central problem in sequential decision making is to develop algorithms...
research
02/12/2020

Beyond UCB: Optimal and Efficient Contextual Bandits with Regression Oracles

A fundamental challenge in contextual bandits is to develop flexible, ge...
research
02/07/2021

Bandits for Learning to Explain from Explanations

We introduce Explearn, an online algorithm that learns to jointly output...
research
05/06/2021

Contextual Bandits with Sparse Data in Web setting

This paper is a scoping study to identify current methods used in handli...

Please sign up or login with your details

Forgot password? Click here to reset