Inspecting Sample Reusability for Active Learning

10/26/2020
by   Katharina Morik, et al.
0

Active Learning (AL) exploits a learning algorithm to selectively sample examples which are expected to be highly useful for model learning. The resulting sample is governed by a sampling selection bias. While a bias towards useful examples is desirable, there is also a bias towards the learner applied during AL selection. This paper addresses sample reusability, i.e., the question whether and under which conditions samples selected by AL using one learning algorithm are well-suited as training data for another learning algorithm. Our empirical investigation on general classification problems as well as the natural language processing subtask of Named Entity Recognition shows that many intuitive assumptions on reusability characteristics do not hold. For example, using the same algorithm during AL selection (called selector) and for inducing the final model (called consumer) is not always the optimal choice. We investigate several putatively explanatory factors for sample reusability. One finding is that the suitability of certain selector-consumer pairings cannot be estimated independently of the actual learning problem.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/08/2020

LTP: A New Active Learning Strategy for Bert-CRF Based Named Entity Recognition

In recent years, deep learning has achieved great success in many natura...
research
06/18/2020

On the Robustness of Active Learning

Active Learning is concerned with the question of how to identify the mo...
research
08/16/2018

Deep Bayesian Active Learning for Natural Language Processing: Results of a Large-Scale Empirical Study

Several recent papers investigate Active Learning (AL) for mitigating th...
research
06/12/2018

Meta-Learning Transferable Active Learning Policies by Deep Reinforcement Learning

Active learning (AL) aims to enable training high performance classifier...
research
07/30/2014

Targeting Optimal Active Learning via Example Quality

In many classification problems unlabelled data is abundant and a subset...
research
02/05/2015

Estimating Optimal Active Learning via Model Retraining Improvement

A central question for active learning (AL) is: "what is the optimal sel...
research
05/30/2019

Understanding Goal-Oriented Active Learning via Influence Functions

Active learning (AL) concerns itself with learning a model from as few l...

Please sign up or login with your details

Forgot password? Click here to reset