ImitAL: Learned Active Learning Strategy on Synthetic Data

08/24/2022
by   Julius Gonsior, et al.
0

Active Learning (AL) is a well-known standard method for efficiently obtaining annotated data by first labeling the samples that contain the most information based on a query strategy. In the past, a large variety of such query strategies has been proposed, with each generation of new strategies increasing the runtime and adding more complexity. However, to the best of our our knowledge, none of these strategies excels consistently over a large number of datasets from different application domains. Basically, most of the the existing AL strategies are a combination of the two simple heuristics informativeness and representativeness, and the big differences lie in the combination of the often conflicting heuristics. Within this paper, we propose ImitAL, a domain-independent novel query strategy, which encodes AL as a learning-to-rank problem and learns an optimal combination between both heuristics. We train ImitAL on large-scale simulated AL runs on purely synthetic datasets. To show that ImitAL was successfully trained, we perform an extensive evaluation comparing our strategy on 13 different datasets, from a wide range of domains, with 7 other query strategies.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/17/2021

ImitAL: Learning Active Learning Strategies from Synthetic Data

One of the biggest challenges that complicates applied supervised machin...
research
12/18/2020

Rebuilding Trust in Active Learning with Actionable Metrics

Active Learning (AL) is an active domain of research, but is seldom used...
research
06/25/2021

Multi-Domain Active Learning: A Comparative Study

Building classifiers on multiple domains is a practical problem in the r...
research
11/02/2020

Reducing Confusion in Active Learning for Part-Of-Speech Tagging

Active learning (AL) uses a data selection algorithm to select useful tr...
research
06/06/2023

How to Select Which Active Learning Strategy is Best Suited for Your Specific Problem and Budget

In Active Learning (AL), a learner actively chooses which unlabeled exam...
research
10/09/2018

Discovering General-Purpose Active Learning Strategies

We propose a general-purpose approach to discovering active learning (AL...
research
06/19/2023

Perturbation-Based Two-Stage Multi-Domain Active Learning

In multi-domain learning (MDL) scenarios, high labeling effort is requir...

Please sign up or login with your details

Forgot password? Click here to reset