Contextual Search for General Hypothesis Classes

03/03/2020
by   Allen Liu, et al.
0

We study a general version of the problem of online learning under binary feedback: there is a hidden function f : X→Y in a certain hypothesis class H. A learner is given adversarially chosen inputs (contexts) x_t ∈X and is asked to submit a guess y_t ∈Y for the value f(x_t). Upon guessing the learner incurs a certain loss L(y_t, f(x_t)) and learns whether y_t ≤ f(x_t) or y_t > f(x_t). The special case where H is the class of linear functions over the unit ball has been studied in a series of papers. We both generalize and improve these results. We provide a O(d^2) regret bound where d is the covering dimension of the hypothesis class. The algorithms are based on a novel technique which we call Steiner potential since in the linear case it reduces to controlling the value of the Steiner polynomial of a convex region at various scales. We also show that this new technique provides optimal regret (up to log factors) in the linear case (i.e. the original contextual search problem), improving the previously known bound of O(d^4) to O(d log d). Finally, we extend these results to a noisy feedback model, where each round our feedback is flipped with fixed probability p < 1/2.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/23/2016

Online Learning with Feedback Graphs Without the Graphs

We study an online learning framework introduced by Mannor and Shamir (2...
research
06/15/2022

Corruption-Robust Contextual Search through Density Updates

We study the problem of contextual search in the adversarial noise model...
research
01/29/2021

Sequential prediction under log-loss and misspecification

We consider the question of sequential prediction under the log-loss in ...
research
09/07/2021

Learning to Bid in Contextual First Price Auctions

In this paper, we investigate the problem about how to bid in repeated c...
research
04/09/2018

Contextual Search via Intrinsic Volumes

We study the problem of contextual search, a multidimensional generaliza...
research
06/25/2021

Littlestone Classes are Privately Online Learnable

We consider the problem of online classification under a privacy constra...
research
02/20/2015

Contextual Semibandits via Supervised Learning Oracles

We study an online decision making problem where on each round a learner...

Please sign up or login with your details

Forgot password? Click here to reset