An Empirical Process Approach to the Union Bound: Practical Algorithms for Combinatorial and Linear Bandits

06/21/2020
by   Julian Katz-Samuels, et al.
0

This paper proposes near-optimal algorithms for the pure-exploration linear bandit problem in the fixed confidence and fixed budget settings. Leveraging ideas from the theory of suprema of empirical processes, we provide an algorithm whose sample complexity scales with the geometry of the instance and avoids an explicit union bound over the number of arms. Unlike previous approaches which sample based on minimizing a worst-case variance (e.g. G-optimal design), we define an experimental design objective based on the Gaussian-width of the underlying arm set. We provide a novel lower bound in terms of this objective that highlights its fundamental role in the sample complexity. The sample complexity of our fixed confidence algorithm matches this lower bound, and in addition is computationally efficient for combinatorial classes, e.g. shortest-path, matchings and matroids, where the arm sets can be exponentially large in the dimension. Finally, we propose the first algorithm for linear bandits in the the fixed budget setting. Its guarantee matches our lower bound up to logarithmic factors.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/24/2021

Combinatorial Pure Exploration with Bottleneck Reward Function and its Extension to General Reward Functions

In this paper, we study the Combinatorial Pure Exploration problem with ...
research
11/09/2021

Practical, Provably-Correct Interactive Learning in the Realizable Setting: The Power of True Believers

We consider interactive learning in the realizable setting and develop a...
research
09/10/2021

Near Instance Optimal Model Selection for Pure Exploration Linear Bandits

The model selection problem in the pure exploration linear bandit settin...
research
06/10/2022

Interactively Learning Preference Constraints in Linear Bandits

We study sequential decision-making with known rewards and unknown const...
research
11/21/2017

Disagreement-based combinatorial pure exploration: Efficient algorithms and an analysis with localization

We design new algorithms for the combinatorial pure exploration problem ...
research
11/02/2021

Dealing With Misspecification In Fixed-Confidence Linear Top-m Identification

We study the problem of the identification of m arms with largest means ...
research
06/03/2023

Asymptotically Optimal Pure Exploration for Infinite-Armed Bandits

We study pure exploration with infinitely many bandit arms generated i.i...

Please sign up or login with your details

Forgot password? Click here to reset