Ranking Median Regression: Learning to Order through Local Consensus

10/31/2017
by   Stephan Clémençon, et al.
0

This article is devoted to the problem of predicting the value taken by a random permutation Σ, describing the preferences of an individual over a set of numbered items {1, ..., n} say, based on the observation of an input/explanatory r.v. X e.g. characteristics of the individual), when error is measured by the Kendall τ distance. In the probabilistic formulation of the 'Learning to Order' problem we propose, which extends the framework for statistical Kemeny ranking aggregation developped in CKS17, this boils down to recovering conditional Kemeny medians of Σ given X from i.i.d. training examples (X_1, Σ_1), ..., (X_N, Σ_N). For this reason, this statistical learning problem is referred to as ranking median regression here. Our contribution is twofold. We first propose a probabilistic theory of ranking median regression: the set of optimal elements is characterized, the performance of empirical risk minimizers is investigated in this context and situations where fast learning rates can be achieved are also exhibited. Next we introduce the concept of local consensus/median, in order to derive efficient methods for ranking median regression. The major advantage of this local learning approach lies in its close connection with the widely studied Kemeny aggregation problem. From an algorithmic perspective, this permits to build predictive rules for ranking median regression by implementing efficient techniques for (approximate) Kemeny median computations at a local level in a tractable manner. In particular, versions of k-nearest neighbor and tree-based methods, tailored to ranking median regression, are investigated. Accuracy of piecewise constant ranking median regression rules is studied under a specific smoothness assumption for Σ's conditional distribution given X.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/20/2022

Statistical Depth Functions for Ranking Distributions: Definitions, Statistical Learning and Applications

The concept of median/consensus has been widely investigated in order to...
research
02/21/2020

A Multiclass Classification Approach to Label Ranking

In multiclass classification, the goal is to learn how to predict a rand...
research
03/22/2023

Robust Consensus in Ranking Data Analysis: Definitions, Properties and Computational Issues

As the issue of robustness in AI systems becomes vital, statistical lear...
research
10/11/2021

Dynamic Median Consensus Over Random Networks

This paper studies the problem of finding the median of N distinct numbe...
research
01/25/2023

Robust non-parametric regression via median-of-means

In this paper, we apply the median-of-means principle to derive robust v...
research
05/04/2020

Exact computation of projection regression depth and fast computation of its induced median and other estimators

Zuo (2019) (Z19) addressed the computation of the projection regression ...
research
06/29/2023

Local Risk Bounds for Statistical Aggregation

In the problem of aggregation, the aim is to combine a given class of ba...

Please sign up or login with your details

Forgot password? Click here to reset