Weaker Than You Think: A Critical Look atWeakly Supervised Learning

05/27/2023
by   Dawei Zhu, et al.
0

Weakly supervised learning is a popular approach for training machine learning models in low-resource settings. Instead of requesting high-quality yet costly human annotations, it allows training models with noisy annotations obtained from various weak sources. Recently, many sophisticated approaches have been proposed for robust training under label noise, reporting impressive results. In this paper, we revisit the setup of these approaches and find that the benefits brought by these approaches are significantly overestimated. Specifically, we find that the success of existing weakly supervised learning approaches heavily relies on the availability of clean validation samples which, as we show, can be leveraged much more efficiently by simply training on them. After using these clean labels in training, the advantages of using these sophisticated approaches are mostly wiped out. This remains true even when reducing the size of the available clean data to just five samples per class, making these approaches impractical. To understand the true value of weakly supervised learning, we thoroughly analyse diverse NLP datasets and tasks to ascertain when and why weakly supervised approaches work, and provide recommendations for future research.

READ FULL TEXT

page 5

page 6

page 19

research
02/19/2023

Weakly Supervised Label Learning Flows

Supervised learning usually requires a large amount of labelled data. Ho...
research
04/23/2021

Knodle: Modular Weakly Supervised Learning with PyTorch

Methods for improving the training and prediction quality of weakly supe...
research
08/30/2021

Noisy Labels for Weakly Supervised Gamma Hadron Classification

Gamma hadron classification, a central machine learning task in gamma ra...
research
06/28/2017

(Machine) Learning to Do More with Less

Determining the best method for training a machine learning algorithm is...
research
09/01/2023

Deep-learning-based Early Fixing for Gas-lifted Oil Production Optimization: Supervised and Weakly-supervised Approaches

Maximizing oil production from gas-lifted oil wells entails solving Mixe...
research
10/29/2019

Model enhancement and personalization using weakly supervised learning for multi-modal mobile sensing

Always-on sensing of mobile device user's contextual information is crit...
research
10/06/2021

A New Weakly Supervised Learning Approach for Real-time Iron Ore Feed Load Estimation

Iron ore feed load control is one of the most critical settings in a min...

Please sign up or login with your details

Forgot password? Click here to reset