Towards the Identifiability in Noisy Label Learning: A Multinomial Mixture Approach

01/04/2023
by   Cuong Nguyen, et al.
0

Learning from noisy labels plays an important role in the deep learning era. Despite numerous studies with promising results, identifying clean labels from a noisily-annotated dataset is still challenging since the conventional noisy label learning problem with single noisy label per instance is not identifiable, i.e., it does not theoretically have a unique solution unless one has access to clean labels or introduces additional assumptions. This paper aims to formally investigate such identifiability issue by formulating the noisy label learning problem as a multinomial mixture model, enabling the formulation of the identifiability constraint. In particular, we prove that the noisy label learning problem is identifiable if there are at least 2C - 1 noisy labels per instance provided, with C being the number of classes. In light of such requirement, we propose a method that automatically generates additional noisy labels per training sample by estimating the noisy label distribution based on nearest neighbours. Such additional noisy labels allow us to apply the Expectation - Maximisation algorithm to estimate the posterior of clean labels. We empirically demonstrate that the proposed method is not only capable of estimating clean labels without any heuristics in several challenging label noise benchmarks, including synthetic, web-controlled and real-world label noises, but also of performing competitively with many state-of-the-art methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/23/2020

Label Noise Types and Their Effects on Deep Learning

The recent success of deep learning is mostly due to the availability of...
research
06/20/2023

MILD: Modeling the Instance Learning Dynamics for Learning with Noisy Labels

Despite deep learning has achieved great success, it often relies on a l...
research
03/07/2017

Learning from Noisy Labels with Distillation

The ability of learning from noisy labels is very useful in many visual ...
research
07/31/2023

LaplaceConfidence: a Graph-based Approach for Learning with Noisy Labels

In real-world applications, perfect labels are rarely available, making ...
research
12/21/2022

Class Prototype-based Cleaner for Label Noise Learning

Semi-supervised learning based methods are current SOTA solutions to the...
research
09/12/2018

Hyperspectral Image Classification in the Presence of Noisy Labels

Label information plays an important role in supervised hyperspectral im...
research
06/04/2019

The Extended Dawid-Skene Model: Fusing Information from Multiple Data Schemas

While label fusion from multiple noisy annotations is a well understood ...

Please sign up or login with your details

Forgot password? Click here to reset