Rethinking Noisy Label Models: Labeler-Dependent Noise with Adversarial Awareness

05/28/2021
by   Glenn Dawson, et al.
0

Most studies on learning from noisy labels rely on unrealistic models of i.i.d. label noise, such as class-conditional transition matrices. More recent work on instance-dependent noise models are more realistic, but assume a single generative process for label noise across the entire dataset. We propose a more principled model of label noise that generalizes instance-dependent noise to multiple labelers, based on the observation that modern datasets are typically annotated using distributed crowdsourcing methods. Under our labeler-dependent model, label noise manifests itself under two modalities: natural error of good-faith labelers, and adversarial labels provided by malicious actors. We present two adversarial attack vectors that more accurately reflect the label noise that may be encountered in real-world settings, and demonstrate that under our multimodal noisy labels model, state-of-the-art approaches for learning from noisy labels are defeated by adversarial label attacks. Finally, we propose a multi-stage, labeler-aware, model-agnostic framework that reliably filters noisy labels by leveraging knowledge about which data partitions were labeled by which labeler, and show that our proposed framework remains robust even in the presence of extreme adversarial label noise.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/23/2021

A Realistic Simulation Framework for Learning with Label Noise

We propose a simulation framework for generating realistic instance-depe...
research
07/10/2023

Leveraging an Alignment Set in Tackling Instance-Dependent Label Noise

Noisy training labels can hurt model performance. Most approaches that a...
research
03/15/2020

NoiseRank: Unsupervised Label Noise Reduction with Dependence Models

Label noise is increasingly prevalent in datasets acquired from noisy ch...
research
11/29/2022

On Robust Learning from Noisy Labels: A Permutation Layer Approach

The existence of label noise imposes significant challenges (e.g., poor ...
research
11/07/2016

Learning Time Series Detection Models from Temporally Imprecise Labels

In this paper, we consider a new low-quality label learning problem: lea...
research
05/11/2020

Multi-Level Generative Models for Partial Label Learning with Non-random Label Noise

Partial label (PL) learning tackles the problem where each training inst...
research
10/05/2020

Learning with Instance-Dependent Label Noise: A Sample Sieve Approach

Human-annotated labels are often prone to noise, and the presence of suc...

Please sign up or login with your details

Forgot password? Click here to reset