Active Learning with Distributional Estimates

10/16/2012
by   Jens Roeder, et al.
0

Active Learning (AL) is increasingly important in a broad range of applications. Two main AL principles to obtain accurate classification with few labeled data are refinement of the current decision boundary and exploration of poorly sampled regions. In this paper we derive a novel AL scheme that balances these two principles in a natural way. In contrast to many AL strategies, which are based on an estimated class conditional probability ^p(y|x), a key component of our approach is to view this quantity as a random variable, hence explicitly considering the uncertainty in its estimated value. Our main contribution is a novel mathematical framework for uncertainty-based AL, and a corresponding AL scheme, where the uncertainty in ^p(y|x) is modeled by a second-order distribution. On the practical side, we show how to approximate such second-order distributions for kernel density classification. Finally, we find that over a large number of UCI, USPS and Caltech4 datasets, our AL scheme achieves significantly better learning curves than popular AL methods such as uncertainty sampling and error reduction sampling, when all use the same kernel density classifier.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/15/2018

On the Relationship between Data Efficiency and Error for Uncertainty Sampling

While active learning offers potential cost savings, the actual data eff...
research
08/06/2014

When does Active Learning Work?

Active Learning (AL) methods seek to improve classifier performance when...
research
05/23/2023

Active Learning Principles for In-Context Learning with Large Language Models

The remarkable advancements in large language models (LLMs) have signifi...
research
10/13/2016

Semi-Supervised Active Learning for Support Vector Machines: A Novel Approach that Exploits Structure Information in Data

In our today's information society more and more data emerges, e.g. in s...
research
05/14/2020

VirAAL: Virtual Adversarial Active Learning

This paper presents VirAAL, an Active Learning framework based on Advers...
research
06/10/2022

Weighted Ensembles for Active Learning with Adaptivity

Labeled data can be expensive to acquire in several application domains,...
research
12/07/2020

Active Learning Methods for Efficient Hybrid Biophysical Variable Retrieval

Kernel-based machine learning regression algorithms (MLRAs) are potentia...

Please sign up or login with your details

Forgot password? Click here to reset