ActiveLab: Active Learning with Re-Labeling by Multiple Annotators

01/27/2023
by   Hui Wen Goh, et al.
0

In real-world data labeling applications, annotators often provide imperfect labels. It is thus common to employ multiple annotators to label data with some overlap between their examples. We study active learning in such settings, aiming to train an accurate classifier by collecting a dataset with the fewest total annotations. Here we propose ActiveLab, a practical method to decide what to label next that works with any classifier model and can be used in pool-based batch active learning with one or multiple annotators. ActiveLab automatically estimates when it is more informative to re-label examples vs. labeling entirely new ones. This is a key aspect of producing high quality labels and trained models within a limited annotation budget. In experiments on image and tabular data, ActiveLab reliably trains more accurate classifiers with far fewer annotations than a wide variety of popular active learning methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/09/2021

Class-Balanced Active Learning for Image Classification

Active learning aims to reduce the labeling effort that is required to t...
research
10/15/2021

Learning with Noisy Labels by Targeted Relabeling

Crowdsourcing platforms are often used to collect datasets for training ...
research
10/13/2022

Utilizing supervised models to infer consensus labels and their quality from data with multiple annotators

Real-world data for classification is often labeled by multiple annotato...
research
08/05/2022

A Holistic Approach to Undesired Content Detection in the Real World

We present a holistic approach to building a robust and useful natural l...
research
06/30/2023

Ticket-BERT: Labeling Incident Management Tickets with Language Models

An essential aspect of prioritizing incident tickets for resolution is e...
research
07/25/2023

Robust Assignment of Labels for Active Learning with Sparse and Noisy Annotations

Supervised classification algorithms are used to solve a growing number ...
research
02/27/2017

Active Learning Using Uncertainty Information

Many active learning methods belong to the retraining-based approaches, ...

Please sign up or login with your details

Forgot password? Click here to reset