Nonparametric Regression with Comparisons: Escaping the Curse of Dimensionality with Ordinal Information

06/08/2018
by   Yichong Xu, et al.
0

In supervised learning, we leverage a labeled dataset to design methods for function estimation. In many practical situations, we are able to obtain alternative feedback, possibly at a low cost. A broad goal is to understand the usefulness of, and to design algorithms to exploit, this alternative feedback. We focus on a semi-supervised setting where we obtain additional ordinal (or comparison) information for potentially unlabeled samples. We consider ordinal feedback of varying qualities where we have either a perfect ordering of the samples, a noisy ordering of the samples or noisy pairwise comparisons between the samples. We provide a precise quantification of the usefulness of these types of ordinal feedback in non-parametric regression, showing that in many cases it is possible to accurately estimate an underlying function with a very small labeled set, effectively escaping the curse of dimensionality. We develop an algorithm called Ranking-Regression (RR) and analyze its accuracy as a function of size of the labeled and unlabeled datasets and various noise parameters. We also present lower bounds, that establish fundamental limits for the task and show that RR is optimal in a variety of settings. Finally, we present experiments that show the efficacy of RR and investigate its robustness to various sources of noise and model-misspecification.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/08/2018

Ordinal Regression using Noisy Pairwise Comparisons for Body Mass Index Range Estimation

Ordinal regression aims to classify instances into ordinal categories. I...
research
01/31/2019

Semi-Supervised Ordinal Regression Based on Empirical Risk Minimization

We consider the semi-supervised ordinal regression problem, where unlabe...
research
05/04/2021

On the Sample Complexity of Rank Regression from Pairwise Comparisons

We consider a rank regression setting, in which a dataset of N samples w...
research
12/24/2019

Quadruply Stochastic Gradient Method for Large Scale Nonlinear Semi-Supervised Ordinal Regression AUC Optimization

Semi-supervised ordinal regression (S^2OR) problems are ubiquitous in re...
research
11/02/2020

Learning Halfspaces with Pairwise Comparisons: Breaking the Barriers of Query Complexity via Crowd Wisdom

In this paper, we study the problem of efficient learning of halfspaces ...
research
12/16/2019

Pairwise Feedback for Data Programming

The scalability of the labeling process and the attainable quality of la...
research
04/30/2021

Network Recovery from Unlabeled Noisy Samples

There is a growing literature on the statistical analysis of multiple ne...

Please sign up or login with your details

Forgot password? Click here to reset