Adversarial Top-K Ranking

02/15/2016
by   Changho Suh, et al.
0

We study the top-K ranking problem where the goal is to recover the set of top-K ranked items out of a large collection of items based on partially revealed preferences. We consider an adversarial crowdsourced setting where there are two population sets, and pairwise comparison samples drawn from one of the populations follow the standard Bradley-Terry-Luce model (i.e., the chance of item i beating item j is proportional to the relative score of item i to item j), while in the other population, the corresponding chance is inversely proportional to the relative score. When the relative size of the two populations is known, we characterize the minimax limit on the sample size required (up to a constant) for reliably identifying the top-K items, and demonstrate how it scales with the relative size. Moreover, by leveraging a tensor decomposition method for disambiguating mixture distributions, we extend our result to the more realistic scenario in which the relative population size is unknown, thus establishing an upper bound on the fundamental limit of the sample size for recovering the top-K set.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/14/2016

Top-K Ranking from Pairwise Comparisons: When Spectral Ranking is Optimal

We explore the top-K rank aggregation problem. Suppose a collection of i...
research
02/01/2019

Graph Resistance and Learning from Pairwise Comparisons

We consider the problem of learning the qualities of a collection of ite...
research
04/23/2021

Heterogeneous item populations across individuals: Consequences for the factor model, item inter-correlations, and scale validity

The paper is devoted to the consequences of blind random selection of it...
research
03/16/2020

Neighborhood-based Pooling for Population-level Label Distribution Learning

Supervised machine learning often requires human-annotated data. While a...
research
05/22/2021

Exact PPS Sampling with Bounded Sample Size

Probability proportional to size (PPS) sampling schemes with a target sa...
research
01/01/2023

Semidefinite programming on population clustering: a global analysis

In this paper, we consider the problem of partitioning a small data samp...

Please sign up or login with your details

Forgot password? Click here to reset