Label Denoising through Cross-Model Agreement

08/27/2023
by   Yu Wang, et al.
0

Learning from corrupted labels is very common in real-world machine-learning applications. Memorizing such noisy labels could affect the learning of the model, leading to sub-optimal performances. In this work, we propose a novel framework to learn robust machine-learning models from noisy labels. Through an empirical study, we find that different models make relatively similar predictions on clean examples, while the predictions on noisy examples vary much more across different models. Motivated by this observation, we propose denoising with cross-model agreement (DeCA) which aims to minimize the KL-divergence between the true label distributions parameterized by two machine learning models while maximizing the likelihood of data observation. We employ the proposed DeCA on both the binary label scenario and the multiple label scenario. For the binary label scenario, we select implicit feedback recommendation as the downstream task and conduct experiments with four state-of-the-art recommendation models on four datasets. For the multiple-label scenario, the downstream application is image classification on two benchmark datasets. Experimental results demonstrate that the proposed methods significantly improve the model performance compared with normal training and other denoising methods on both binary and multiple-label scenarios.

READ FULL TEXT

page 5

page 6

page 21

page 22

research
05/20/2021

Probabilistic and Variational Recommendation Denoising

Learning from implicit feedback is one of the most common cases in the a...
research
08/17/2022

CTRL: Clustering Training Losses for Label Error Detection

In supervised machine learning, use of correct labels is extremely impor...
research
03/13/2023

Twin Contrastive Learning with Noisy Labels

Learning from noisy data is a challenging task that significantly degene...
research
05/30/2023

DyGen: Learning from Noisy Labels via Dynamics-Enhanced Generative Modeling

Learning from noisy labels is a challenge that arises in many real-world...
research
03/07/2017

Learning from Noisy Labels with Distillation

The ability of learning from noisy labels is very useful in many visual ...
research
04/26/2020

Deep k-NN for Noisy Labels

Modern machine learning models are often trained on examples with noisy ...
research
05/31/2023

Label-Retrieval-Augmented Diffusion Models for Learning from Noisy Labels

Learning from noisy labels is an important and long-standing problem in ...

Please sign up or login with your details

Forgot password? Click here to reset