Ranking Data with Continuous Labels through Oriented Recursive Partitions

01/17/2018
by   Stephan Clémençon, et al.
0

We formulate a supervised learning problem, referred to as continuous ranking, where a continuous real-valued label Y is assigned to an observable r.v. X taking its values in a feature space X and the goal is to order all possible observations x in X by means of a scoring function s:X→R so that s(X) and Y tend to increase or decrease together with highest probability. This problem generalizes bi/multi-partite ranking to a certain extent and the task of finding optimal scoring functions s(x) can be naturally cast as optimization of a dedicated functional criterion, called the IROC curve here, or as maximization of the Kendall τ related to the pair (s(X), Y ). From the theoretical side, we describe the optimal elements of this problem and provide statistical guarantees for empirical Kendall τ maximization under appropriate conditions for the class of scoring function candidates. We also propose a recursive statistical learning algorithm tailored to empirical IROC curve optimization and producing a piecewise constant scoring function that is fully described by an oriented binary tree. Preliminary numerical experiments highlight the difference in nature between regression and continuous ranking and provide strong empirical evidence of the performance of empirical optimizers of the criteria proposed.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/03/2017

Mass Volume Curves and Anomaly Ranking

This paper aims at formulating the issue of ranking multivariate unlabel...
research
02/21/2020

A Multiclass Classification Approach to Label Ranking

In multiclass classification, the goal is to learn how to predict a rand...
research
09/20/2021

Learning to Rank Anomalies: Scalar Performance Criteria and Maximization of Two-Sample Rank Statistics

The ability to collect and store ever more massive databases has been ac...
research
02/05/2015

On Anomaly Ranking and Excess-Mass Curves

Learning how to rank multivariate unlabeled observations depending on th...
research
06/21/2019

On Tree-based Methods for Similarity Learning

In many situations, the choice of an adequate similarity measure or metr...
research
12/18/2013

Functional Bipartite Ranking: a Wavelet-Based Filtering Approach

It is the main goal of this article to address the bipartite ranking iss...
research
01/30/2014

Support vector comparison machines

In ranking problems, the goal is to learn a ranking function from labele...

Please sign up or login with your details

Forgot password? Click here to reset