Exploiting Transitivity for Top-k Selection with Score-Based Dueling Bandits

12/31/2020
by   Matthew Groves, et al.
0

We consider the problem of top-k subset selection in Dueling Bandit problems with score information. Real-world pairwise ranking problems often exhibit a high degree of transitivity and prior work has suggested sampling methods that exploit such transitivity through the use of parametric preference models like the Bradley-Terry-Luce (BTL) and Thurstone models. To date, this work has focused on cases where sample outcomes are win/loss binary responses. We extend this to selection problems where sampling results contain quantitative information by proposing a Thurstonian style model and adapting the Pairwise Optimal Computing Budget Allocation for subset selection (POCBAm) sampling method to exploit this model for efficient sample selection. We compare the empirical performance against standard POCBAm and other competing algorithms.

READ FULL TEXT

page 16

page 21

research
10/23/2018

Active Ranking with Subset-wise Preferences

We consider the problem of probably approximately correct (PAC) ranking ...
research
07/15/2022

Selection of the Most Probable Best

We consider an expected-value ranking and selection problem where all k ...
research
03/20/2021

On Subspace Approximation and Subset Selection in Fewer Passes by MCMC Sampling

We consider the problem of subset selection for ℓ_p subspace approximati...
research
02/09/2017

Inductive Pairwise Ranking: Going Beyond the n log(n) Barrier

We study the problem of ranking a set of items from nonactively chosen p...
research
06/07/2023

Fair Column Subset Selection

We consider the problem of fair column subset selection. In particular, ...
research
10/20/2018

Hybrid-MST: A Hybrid Active Sampling Strategy for Pairwise Preference Aggregation

In this paper we present a hybrid active sampling strategy for pairwise ...
research
06/10/2021

Problem-solving benefits of down-sampled lexicase selection

In genetic programming, an evolutionary method for producing computer pr...

Please sign up or login with your details

Forgot password? Click here to reset