On the Relationship between Data Efficiency and Error for Uncertainty Sampling

06/15/2018
by   Stephen Mussmann, et al.
0

While active learning offers potential cost savings, the actual data efficiency---the reduction in amount of labeled data needed to obtain the same error rate---observed in practice is mixed. This paper poses a basic question: when is active learning actually helpful? We provide an answer for logistic regression with the popular active learning algorithm, uncertainty sampling. Empirically, on 21 datasets from OpenML, we find a strong inverse correlation between data efficiency and the error rate of the final classifier. Theoretically, we show that for a variant of uncertainty sampling, the asymptotic data efficiency is within a constant factor of the inverse error rate of the limiting classifier.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/29/2021

Convergence of Uncertainty Sampling for Active Learning

Uncertainty sampling in active learning is heavily used in practice to r...
research
10/16/2012

Active Learning with Distributional Estimates

Active Learning (AL) is increasingly important in a broad range of appli...
research
04/17/2020

Active Sentence Learning by Adversarial Uncertainty Sampling in Discrete Space

In this paper, we focus on reducing the labeled data size for sentence l...
research
12/01/2022

Uniform versus uncertainty sampling: When being active is less efficient than staying passive

It is widely believed that given the same labeling budget, active learni...
research
05/21/2023

When are ensembles really effective?

Ensembling has a long history in statistical data analysis, with many im...
research
02/09/2022

Improving greedy core-set configurations for active learning with uncertainty-scaled distances

We scale perceived distances of the core-set algorithm by a factor of un...
research
10/27/2021

Active-LATHE: An Active Learning Algorithm for Boosting the Error Exponent for Learning Homogeneous Ising Trees

The Chow-Liu algorithm (IEEE Trans. Inform. Theory, 1968) has been a mai...

Please sign up or login with your details

Forgot password? Click here to reset