ULF: Unsupervised Labeling Function Correction using Cross-Validation for Weak Supervision

04/14/2022
by   Anastasiia Sedova, et al.
0

A way to overcome expensive and time-consuming manual data labeling is weak supervision - automatic annotation of data samples via a predefined set of labeling functions (LFs), rule-based mechanisms that generate potentially erroneous labels. In this work, we investigate noise reduction techniques for weak supervision based on the principle of k-fold cross-validation. In particular, we extend two frameworks for detecting the erroneous samples in manually annotated data to the weakly supervised setting. Our methods profit from leveraging the information about matching LFs and detect noisy samples more accurately. We also introduce a new algorithm for denoising the weakly annotated data called ULF, that refines the allocation of LFs to classes by estimating the reliable LFs-to-classes joint matrix. Evaluation on several datasets shows that ULF successfully improves weakly supervised learning without using any manually labeled data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/18/2023

Generalized Weak Supervision for Neural Information Retrieval

Neural ranking models (NRMs) have demonstrated effective performance in ...
research
04/28/2022

WeaNF: Weak Supervision with Normalizing Flows

A popular approach to decrease the need for costly manual annotation of ...
research
06/21/2021

Demonstration of Panda: A Weakly Supervised Entity Matching System

Entity matching (EM) refers to the problem of identifying tuple pairs in...
research
06/03/2022

XPASC: Measuring Generalization in Weak Supervision by Explainability and Association

Weak supervision is leveraged in a wide range of domains and tasks due t...
research
05/26/2020

Learning with Weak Supervision for Email Intent Detection

Email remains one of the most frequently used means of online communicat...
research
09/23/2022

From Weakly Supervised Learning to Active Learning

Applied mathematics and machine computations have raised a lot of hope s...
research
06/18/2022

Weakly Supervised Classification of Vital Sign Alerts as Real or Artifact

A significant proportion of clinical physiologic monitoring alarms are f...

Please sign up or login with your details

Forgot password? Click here to reset