Loss factorization, weakly supervised learning and label noise robustness

02/08/2016
by   Giorgio Patrini, et al.
0

We prove that the empirical risk of most well-known loss functions factors into a linear term aggregating all labels with a term that is label free, and can further be expressed by sums of the loss. This holds true even for non-smooth, non-convex losses and in any RKHS. The first term is a (kernel) mean operator --the focal quantity of this work-- which we characterize as the sufficient statistic for the labels. The result tightens known generalization bounds and sheds new light on their interpretation. Factorization has a direct application on weakly supervised learning. In particular, we demonstrate that algorithms like SGD and proximal methods can be adapted with minimal effort to handle weak supervision, once the mean operator has been estimated. We apply this idea to learning with asymmetric noisy labels, connecting and extending prior work. Furthermore, we show that most losses enjoy a data-dependent (by the mean operator) form of noise robustness, in contrast with known negative results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/13/2022

Losses over Labels: Weakly Supervised Learning via Direct Loss Construction

Owing to the prohibitive costs of generating large amounts of labeled da...
research
12/06/2018

Theoretical Guarantees of Deep Embedding Losses Under Label Noise

Collecting labeled data to train deep neural networks is costly and even...
research
03/04/2021

Lower-bounded proper losses for weakly supervised classification

This paper discusses the problem of weakly supervised learning of classi...
research
06/10/2021

Leveraged Weighted Loss for Partial Label Learning

As an important branch of weakly supervised learning, partial label lear...
research
06/11/2021

On the Robustness of Average Losses for Partial-Label Learning

Partial-label (PL) learning is a typical weakly supervised classificatio...
research
08/30/2021

Noisy Labels for Weakly Supervised Gamma Hadron Classification

Gamma hadron classification, a central machine learning task in gamma ra...
research
10/19/2020

Importance Reweighting for Biquality Learning

The field of Weakly Supervised Learning (WSL) has recently seen a surge ...

Please sign up or login with your details

Forgot password? Click here to reset