Best-item Learning in Random Utility Models with Subset Choices

02/19/2020
by   Aadirupa Saha, et al.
0

We consider the problem of PAC learning the most valuable item from a pool of n items using sequential, adaptively chosen plays of subsets of k items, when, upon playing a subset, the learner receives relative feedback sampled according to a general Random Utility Model (RUM) with independent noise perturbations to the latent item utilities. We identify a new property of such a RUM, termed the minimum advantage, that helps in characterizing the complexity of separating pairs of items based on their relative win/loss empirical counts, and can be bounded as a function of the noise distribution alone. We give a learning algorithm for general RUMs, based on pairwise relative counts of items and hierarchical elimination, along with a new PAC sample complexity guarantee of O(n/c^2ϵ^2logk/δ) rounds to identify an ϵ-optimal item with confidence 1-δ, when the worst case pairwise advantage in the RUM has sensitivity at least c to the parameter gaps of items. Fundamental lower bounds on PAC sample complexity show that this is near-optimal in terms of its dependence on n,k and c.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/23/2018

Active Ranking with Subset-wise Preferences

We consider the problem of probably approximately correct (PAC) ranking ...
research
03/01/2019

From PAC to Instance-Optimal Sample Complexity in the Plackett-Luce Model

We consider PAC learning for identifying a good item from subset-wise sa...
research
08/12/2018

PAC-Battling Bandits with Plackett-Luce: Tradeoff between Sample Complexity and Subset Size

We introduce the probably approximately correct (PAC) version of the pro...
research
07/13/2023

Nested Elimination: A Simple Algorithm for Best-Item Identification from Choice-Based Feedback

We study the problem of best-item identification from choice-based feedb...
research
06/08/2018

PAC Ranking from Pairwise and Listwise Queries: Lower Bounds and Upper Bounds

This paper explores the adaptively (active) PAC (probably approximately ...
research
02/22/2023

An Interpretable Determinantal Choice Model for Subset Selection

Understanding how subsets of items are chosen from offered sets is criti...
research
05/23/2016

Actively Learning Hemimetrics with Applications to Eliciting User Preferences

Motivated by an application of eliciting users' preferences, we investig...

Please sign up or login with your details

Forgot password? Click here to reset