Statistical Complexity and Optimal Algorithms for Non-linear Ridge Bandits

02/12/2023
by   Nived Rajaraman, et al.
0

We consider the sequential decision-making problem where the mean outcome is a non-linear function of the chosen action. Compared with the linear model, two curious phenomena arise in non-linear models: first, in addition to the "learning phase" with a standard parametric rate for estimation or regret, there is an "burn-in period" with a fixed cost determined by the non-linear function; second, achieving the smallest burn-in cost requires new exploration algorithms. For a special family of non-linear functions named ridge functions in the literature, we derive upper and lower bounds on the optimal burn-in cost, and in addition, on the entire learning trajectory during the burn-in period via differential equations. In particular, a two-stage algorithm that first finds a good initial action and then treats the problem as locally linear is statistically optimal. In contrast, several classical algorithms, such as UCB and algorithms relying on regression oracles, are provably suboptimal.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/12/2018

Semiparametric Contextual Bandits

This paper studies semiparametric contextual bandits, a generalization o...
research
07/06/2023

Optimal Scalarizations for Sublinear Hypervolume Regret

Scalarization is a general technique that can be deployed in any multiob...
research
10/12/2022

Maximum entropy exploration in contextual bandits with neural networks and energy based models

Contextual bandits can solve a huge range of real-world problems. Howeve...
research
11/28/2021

New Development of Homotopy Analysis Method for a Non-linear Integro-Differential Equations with initial conditions

Homotopy analysis method (HAM) was proposed by Liao in 1992 in his PhD t...
research
05/31/2017

The ALAMO approach to machine learning

ALAMO is a computational methodology for leaning algebraic functions fro...
research
03/30/2018

Statistical Non-linear Model, Achievable Rates and Signal Detection for Photon-level Photomultiplier Receiver

We characterize the practical receiver in a wide range of signal intensi...
research
07/08/2019

General non-linear Bellman equations

We consider a general class of non-linear Bellman equations. These open ...

Please sign up or login with your details

Forgot password? Click here to reset