Robust Online Classification: From Estimation to Denoising

09/04/2023
by   Changlong Wu, et al.
0

We study online classification in the presence of noisy labels. The noise mechanism is modeled by a general kernel that specifies, for any feature-label pair, a (known) set of distributions over noisy labels. At each time step, an adversary selects an unknown distribution from the distribution set specified by the kernel based on the actual feature-label pair, and generates the noisy label from the selected distribution. The learner then makes a prediction based on the actual features and noisy labels observed thus far, and incurs loss 1 if the prediction differs from the underlying truth (and 0 otherwise). The prediction quality is quantified through minimax risk, which computes the cumulative loss over a finite horizon T. We show that for a wide range of natural noise kernels, adversarially selected features, and finite class of labeling functions, minimax risk can be upper bounded independent of the time horizon and logarithmic in the size of labeling function class. We then extend these results to inifinite classes and stochastically generated features via the concept of stochastic sequential covering. Our results extend and encompass findings of Ben-David et al. (2009) through substantial generality, and provide intuitive understanding through a novel reduction to online conditional distribution estimation.

READ FULL TEXT
research
07/23/2021

A Realistic Simulation Framework for Learning with Label Noise

We propose a simulation framework for generating realistic instance-depe...
research
12/18/2019

Towards Robust Learning with Different Label Noise Distributions

Noisy labels are an unavoidable consequence of automatic image labeling ...
research
04/19/2019

Online Active Learning: Label Complexity vs. Classification Errors

We study online active learning for classifying streaming instances. At ...
research
03/05/2013

Classification with Asymmetric Label Noise: Consistency and Maximal Denoising

In many real-world classification problems, the labels of training examp...
research
02/14/2019

Classification with unknown class conditional label noise on non-compact feature spaces

We investigate the problem of classification in the presence of unknown ...
research
02/02/2019

Supervised classification via minimax probabilistic transformations

One of the most common and studied problem in machine learning is classi...

Please sign up or login with your details

Forgot password? Click here to reset