Algorithm Selection as a Bandit Problem with Unbounded Losses

07/09/2008
by   Matteo Gagliolo, et al.
0

Algorithm selection is typically based on models of algorithm performance, learned during a separate offline training sequence, which can be prohibitively expensive. In recent work, we adopted an online approach, in which a performance model is iteratively updated and used to guide selection on a sequence of problem instances. The resulting exploration-exploitation trade-off was represented as a bandit problem with expert advice, using an existing solver for this game, but this required the setting of an arbitrary bound on algorithm runtimes, thus invalidating the optimal regret of the solver. In this paper, we propose a simpler framework for representing algorithm selection as a bandit problem, with partial information, and an unknown bound on losses. We adapt an existing solver to this game, proving a bound on its expected regret, which holds also for the resulting algorithm selection technique. We present preliminary experiments with a set of SAT solvers on a mixed SAT-UNSAT benchmark.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/13/2022

Second Order Regret Bounds Against Generalized Expert Sequences under Partial Bandit Feedback

We study the problem of expert advice under partial bandit feedback sett...
research
12/07/2020

Online Model Selection: a Rested Bandit Formulation

Motivated by a natural problem in online model selection with bandit inf...
research
07/01/2011

Simple Algorithm Portfolio for SAT

The importance of algorithm portfolio techniques for SAT has long been n...
research
09/09/2020

A Generalized Online Algorithm for Translation and Scale Invariant Prediction with Expert Advice

In this work, we aim to create a completely online algorithmic framework...
research
03/12/2023

Data Dependent Regret Guarantees Against General Comparators for Full or Bandit Feedback

We study the adversarial online learning problem and create a completely...
research
07/19/2020

Who Verifies the Verifiers? A Computer-Checked Implementation of the DPLL Algorithm in Dafny

We build a SAT solver implementing the DPLL algorithm in the verificatio...
research
02/18/2014

Concurrent Cube-and-Conquer

Recent work introduced the cube-and-conquer technique to solve hard SAT ...

Please sign up or login with your details

Forgot password? Click here to reset