Consistency of Ranking Estimators

09/02/2019
by   Toby Kenney, et al.
0

The ranking problem is to order a collection of units by some unobserved parameter, based on observations from the associated distribution. This problem arises naturally in a number of contexts, such as business, where we may want to rank potential projects by profitability; or science, where we may want to rank predictors potentially associated with some trait by the strength of the association. This approach provides a valuable alternative to the sparsity framework often used with big data. Most approaches to this problem are empirical Bayesian, where we use the data to estimate the hyperparameters of the prior distribution, then use that distribution to estimate the unobserved parameter values. There are a number of different approaches to this problem, based on different loss functions for mis-ranking units. Despite the number of papers developing methods for this problem, there is no work on the consistency of these methods. In this paper, we develop a general framework for consistency of empirical Bayesian ranking methods, which includes nearly all commonly used methods. We then determine conditions under which consistency holds. Given that little work has been done on selection of prior distribution, and that the loss functions developed are not strongly motivated, we consider the case where both of these are misspecified. We show that provided the loss function is reasonable; the prior distribution is not too light-tailed; and the error in measuring each unit converges to zero at a fast enough rate compared with the number of units (which is assumed to increase to infinity); all ranking methods are consistent.

READ FULL TEXT
research
10/26/2021

Optimal Bayesian Estimation of a Regression Curve, a Conditional Density and a Conditional Distribution

In this paper several related estimation problems are addressed from a B...
research
04/24/2013

A Theoretical Analysis of NDCG Type Ranking Measures

A central problem in ranking is to design a ranking measure for evaluati...
research
11/22/2019

An Alternative Cross Entropy Loss for Learning-to-Rank

Listwise learning-to-rank methods form a powerful class of ranking algor...
research
05/11/2023

Adaptive Graduated Nonconvexity Loss

Many problems in robotics, such as estimating the state from noisy senso...
research
10/08/2022

Empirical Bayesian Selection for Value Maximization

We study the common problem of selecting the best m units from a set of ...
research
10/10/2013

Feature Selection with Annealing for Computer Vision and Big Data Learning

Many computer vision and medical imaging problems are faced with learnin...
research
11/29/2019

Generalized inferential models for censored data

Inferential challenges that arise when data are censored have been exten...

Please sign up or login with your details

Forgot password? Click here to reset