Cost-Accuracy Aware Adaptive Labeling for Active Learning

05/24/2021
by   Ruijiang Gao, et al.
0

Conventional active learning algorithms assume a single labeler that produces noiseless label at a given, fixed cost, and aim to achieve the best generalization performance for given classifier under a budget constraint. However, in many real settings, different labelers have different labeling costs and can yield different labeling accuracies. Moreover, a given labeler may exhibit different labeling accuracies for different instances. This setting can be referred to as active learning with diverse labelers with varying costs and accuracies, and it arises in many important real settings. It is therefore beneficial to understand how to effectively trade-off between labeling accuracy for different instances, labeling costs, as well as the informativeness of training instances, so as to achieve the best generalization performance at the lowest labeling cost. In this paper, we propose a new algorithm for selecting instances, labelers (and their corresponding costs and labeling accuracies), that employs generalization bound of learning with label noise to select informative instances and labelers so as to achieve higher generalization accuracy at a lower cost. Our proposed algorithm demonstrates state-of-the-art performance on five UCI and a real crowdsourcing dataset.

READ FULL TEXT
research
03/03/2017

Active Learning for Cost-Sensitive Classification

We design an active learning algorithm for cost-sensitive multiclass cla...
research
06/24/2020

Minimum Cost Active Labeling

Labeling a data set completely is important for groundtruth generation. ...
research
01/24/2020

Active Learning for Entity Alignment

In this work, we propose a novel framework for the labeling of entity al...
research
09/14/2022

Data Lifecycle Management in Evolving Input Distributions for Learning-based Aerospace Applications

As input distributions evolve over a mission lifetime, maintaining perfo...
research
10/27/2020

Active Learning for Noisy Data Streams Using Weak and Strong Labelers

Labeling data correctly is an expensive and challenging task in machine ...
research
09/13/2019

Active learning for level set estimation under cost-dependent input uncertainty

As part of a quality control process in manufacturing it is often necess...
research
01/25/2022

Online Active Learning with Dynamic Marginal Gain Thresholding

The blessing of ubiquitous data also comes with a curse: the communicati...

Please sign up or login with your details

Forgot password? Click here to reset