Classification with unknown class conditional label noise on non-compact feature spaces

02/14/2019
by   Henry W J Reeve, et al.
0

We investigate the problem of classification in the presence of unknown class conditional label noise in which the labels observed by the learner have been corrupted with some unknown class dependent probability. In order to obtain finite sample rates, previous approaches to classification with unknown class conditional label noise have required that the regression function attains its extrema uniformly on sets of positive measure. We shall consider this problem in the setting of non-compact metric spaces, where the regression function need not attain its extrema. In this setting we determine the minimax optimal learning rates (up to logarithmic factors). The rate displays interesting threshold behaviour: When the regression function approaches its extrema at a sufficient rate, the optimal learning rates are of the same order as those obtained in the label-noise free setting. If the regression function approaches its extrema more gradually then classification performance necessarily degrades. In addition, we present an algorithm which attains these rates without prior knowledge of either the distributional parameters or the local density. This identifies for the first time a scenario in which finite sample rates are achievable in the label noise setting, but they differ from the optimal rates without label noise.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/11/2019

Fast Rates for a kNN Classifier Robust to Unknown Asymmetric Label Noise

We consider classification in the presence of class-dependent asymmetric...
research
11/27/2014

Classification with Noisy Labels by Importance Reweighting

In this paper, we study a classification problem in which sample labels ...
research
09/30/2017

Decontamination of Mutual Contamination Models

Many machine learning problems can be characterized by mutual contaminat...
research
09/02/2021

Optimal subgroup selection

In clinical trials and other applications, we often see regions of the f...
research
07/19/2017

Rates of Uniform Consistency for k-NN Regression

We derive high-probability finite-sample uniform rates of consistency fo...
research
04/16/2023

Regression and Algorithmic Information Theory

In this paper we prove a theorem about regression, in that the shortest ...
research
09/04/2023

Robust Online Classification: From Estimation to Denoising

We study online classification in the presence of noisy labels. The nois...

Please sign up or login with your details

Forgot password? Click here to reset