A General Framework for Learning under Corruption: Label Noise, Attribute Noise, and Beyond

07/17/2023
by   Laura Iacovissi, et al.
0

Corruption is frequently observed in collected data and has been extensively studied in machine learning under different corruption models. Despite this, there remains a limited understanding of how these models relate such that a unified view of corruptions and their consequences on learning is still lacking. In this work, we formally analyze corruption models at the distribution level through a general, exhaustive framework based on Markov kernels. We highlight the existence of intricate joint and dependent corruptions on both labels and attributes, which are rarely touched by existing research. Further, we show how these corruptions affect standard supervised learning by analyzing the resulting changes in Bayes Risk. Our findings offer qualitative insights into the consequences of "more complex" corruptions on the learning problem, and provide a foundation for future quantitative comparisons. Applications of the framework include corruption-corrected learning, a subcase of which we study in this paper by theoretically analyzing loss correction with respect to different corruption instances.

READ FULL TEXT

page 8

page 9

page 10

page 12

research
03/04/2022

Learning from Label Proportions by Learning with Label Noise

Learning from label proportions (LLP) is a weakly supervised classificat...
research
03/13/2021

Learning with Feature-Dependent Label Noise: A Progressive Approach

Label noise is frequently observed in real-world large-scale datasets. T...
research
11/09/2020

A Survey of Label-noise Representation Learning: Past, Present and Future

Classical machine learning implicitly assumes that labels of the trainin...
research
03/02/2020

Structured Prediction with Partial Labelling through the Infimum Loss

Annotating datasets is one of the main costs in nowadays supervised lear...
research
01/28/2019

A Framework for Understanding Unintended Consequences of Machine Learning

As machine learning increasingly affects people and society, it is impor...
research
09/23/2020

Using Under-trained Deep Ensembles to Learn Under Extreme Label Noise

Improper or erroneous labelling can pose a hindrance to reliable general...
research
04/12/2021

Understanding Prediction Discrepancies in Machine Learning Classifiers

A multitude of classifiers can be trained on the same data to achieve si...

Please sign up or login with your details

Forgot password? Click here to reset