Beyond Active Learning: Leveraging the Full Potential of Human Interaction via Auto-Labeling, Human Correction, and Human Verification

06/02/2023
by   Nathan Beck, et al.
0

Active Learning (AL) is a human-in-the-loop framework to interactively and adaptively label data instances, thereby enabling significant gains in model performance compared to random sampling. AL approaches function by selecting the hardest instances to label, often relying on notions of diversity and uncertainty. However, we believe that these current paradigms of AL do not leverage the full potential of human interaction granted by automated label suggestions. Indeed, we show that for many classification tasks and datasets, most people verifying if an automatically suggested label is correct take 3× to 4× less time than they do changing an incorrect suggestion to the correct label (or labeling from scratch without any suggestion). Utilizing this result, we propose CLARIFIER (aCtive LeARnIng From tIEred haRdness), an Interactive Learning framework that admits more effective use of human interaction by leveraging the reduced cost of verification. By targeting the hard (uncertain) instances with existing AL methods, the intermediate instances with a novel label suggestion scheme using submodular mutual information functions on a per-class basis, and the easy (confident) instances with highest-confidence auto-labeling, CLARIFIER can improve over the performance of existing AL approaches on multiple datasets – particularly on those that have a large number of classes – by almost 1.5× to 2× in terms of relative labeling cost.

READ FULL TEXT

page 2

page 4

page 12

page 13

research
06/17/2022

Active Data Discovery: Mining Unknown Data using Submodular Information Measures

Active Learning is a very common yet powerful framework for iteratively ...
research
01/30/2020

Fase-AL – Adaptation of Fast Adaptive Stacking of Ensembles for Supporting Active Learning

Classification algorithms to mine data stream have been extensively stud...
research
04/26/2021

Unsupervised Instance Selection with Low-Label, Supervised Learning for Outlier Detection

The laborious process of labeling data often bottlenecks projects that a...
research
01/09/2023

Active Learning for Abstractive Text Summarization

Construction of human-curated annotated datasets for abstractive text su...
research
06/09/2020

Cost-effective Interactive Attention Learning with Neural Attention Processes

We propose a novel interactive learning framework which we refer to as I...
research
01/24/2020

Active Learning for Entity Alignment

In this work, we propose a novel framework for the labeling of entity al...
research
10/03/2021

Annotation Cost Reduction of Stream-based Active Learning by Automated Weak Labeling using a Robot Arm

Stream-based active learning (AL) is an efficient training data collecti...

Please sign up or login with your details

Forgot password? Click here to reset