On the Linear Ordering Problem and the Rankability of Data

04/12/2021
by   Thomas R. Cameron, et al.
0

In 2019, Anderson et al. proposed the concept of rankability, which refers to a dataset's inherent ability to be meaningfully ranked. In this article, we give an expository review of the linear ordering problem (LOP) and then use it to analyze the rankability of data. Specifically, the degree of linearity is used to quantify what percentage of the data aligns with an optimal ranking. In a sports context, this is analogous to the number of games that a ranking can correctly predict in hindsight. In fact, under the appropriate objective function, we show that the optimal rankings computed via the LOP maximize the hindsight accuracy of a ranking. Moreover, we develop a binary program to compute the maximal Kendall tau ranking distance between two optimal rankings, which can be used to measure the diversity among optimal rankings without having to enumerate all optima. Finally, we provide several examples from the world of sports and college rankings to illustrate these concepts and demonstrate our results.

READ FULL TEXT
research
06/21/2022

Developing a Ranking Problem Library (RPLIB) from a data-oriented perspective

We present an improved library for the ranking problem called RPLIB. RPL...
research
09/01/2023

Amortizing Pragmatic Program Synthesis with Rankings

In program synthesis, an intelligent system takes in a set of user-gener...
research
05/13/2019

Consequential Ranking Algorithms and Long-term Welfare

Ranking models are typically designed to provide rankings that optimize ...
research
02/02/2021

On absolutely and simply popular rankings

Van Zuylen et al. introduced the notion of a popular ranking in a voting...
research
10/15/2018

Dimensionality Reduction and (Bucket) Ranking: a Mass Transportation Approach

Whereas most dimensionality reduction techniques (e.g. PCA, ICA, NMF) fo...
research
07/26/2012

Optimal Data Collection For Informative Rankings Expose Well-Connected Graphs

Given a graph where vertices represent alternatives and arcs represent p...
research
01/07/2022

A Unified Statistical Learning Model for Rankings and Scores with Application to Grant Panel Review

Rankings and scores are two common data types used by judges to express ...

Please sign up or login with your details

Forgot password? Click here to reset