Nuclear Discrepancy for Active Learning

06/08/2017
by   Tom J. Viering, et al.
0

Active learning algorithms propose which unlabeled objects should be queried for their labels to improve a predictive model the most. We study active learners that minimize generalization bounds and uncover relationships between these bounds that lead to an improved approach to active learning. In particular we show the relation between the bound of the state-of-the-art Maximum Mean Discrepancy (MMD) active learner, the bound of the Discrepancy, and a new and looser bound that we refer to as the Nuclear Discrepancy bound. We motivate this bound by a probabilistic argument: we show it considers situations which are more likely to occur. Our experiments indicate that active learning using the tightest Discrepancy bound performs the worst in terms of the squared loss. Overall, our proposed loosest Nuclear Discrepancy generalization bound performs the best. We confirm our probabilistic argument empirically: the other bounds focus on more pessimistic scenarios that are rarer in practice. We conclude that tightness of bounds is not always of main importance and that active learning methods should concentrate on realistic scenarios in order to improve performance.

READ FULL TEXT
research
07/23/2021

MCDAL: Maximum Classifier Discrepancy for Active Learning

Recent state-of-the-art active learning methods have mostly leveraged Ge...
research
03/05/2021

Discrepancy-Based Active Learning for Domain Adaptation

The goal of the paper is to design active learning strategies which lead...
research
09/28/2021

Active Learning for Argument Mining: A Practical Approach

Despite considerable recent progress, the creation of well-balanced and ...
research
02/25/2019

Interpreting Active Learning Methods Through Information Losses

We propose a new way of interpreting active learning methods by analyzin...
research
11/20/2022

Multifidelity Active Learning for Failure Estimation of TRISO Nuclear Fuel

The Tristructural isotropic (TRISO)-coated particle fuel is a robust nuc...
research
06/15/2023

Re-Benchmarking Pool-Based Active Learning for Binary Classification

Active learning is a paradigm that significantly enhances the performanc...
research
04/19/2021

A Mathematical Analysis of Learning Loss for Active Learning in Regression

Active learning continues to remain significant in the industry since it...

Please sign up or login with your details

Forgot password? Click here to reset