Towards Computationally Feasible Deep Active Learning

05/07/2022
by   Akim Tsvigun, et al.
25

Active learning (AL) is a prominent technique for reducing the annotation effort required for training machine learning models. Deep learning offers a solution for several essential obstacles to deploying AL in practice but introduces many others. One of such problems is the excessive computational resources required to train an acquisition model and estimate its uncertainty on instances in the unlabeled pool. We propose two techniques that tackle this issue for text classification and tagging tasks, offering a substantial reduction of AL iteration duration and the computational overhead introduced by deep acquisition models in AL. We also demonstrate that our algorithm that leverages pseudo-labeling and distilled models overcomes one of the essential obstacles revealed previously in the literature. Namely, it was shown that due to differences between an acquisition model used to select instances during AL and a successor model trained on the labeled data, the benefits of AL can diminish. We show that our algorithm, despite using a smaller and faster acquisition model, is capable of training a more expressive successor model with higher performance.

READ FULL TEXT

page 7

page 14

research
09/10/2021

Active learning for reducing labeling effort in text classification tasks

Labeling data can be an expensive task as it is usually performed manual...
research
07/22/2020

DEAL: Deep Evidential Active Learning for Image Classification

Convolutional Neural Networks (CNNs) have proven to be state-of-the-art ...
research
01/18/2022

Optimizing Active Learning for Low Annotation Budgets

When we can not assume a large amount of annotated data , active learnin...
research
05/16/2023

On Dataset Transferability in Active Learning for Transformers

Active learning (AL) aims to reduce labeling costs by querying the examp...
research
06/10/2022

Weighted Ensembles for Active Learning with Adaptivity

Labeled data can be expensive to acquire in several application domains,...
research
01/20/2021

Active Learning for Sequence Tagging with Deep Pre-trained Models and Bayesian Uncertainty Estimates

Annotating training data for sequence tagging tasks is usually very time...
research
10/27/2021

Diversity Enhanced Active Learning with Strictly Proper Scoring Rules

We study acquisition functions for active learning (AL) for text classif...

Please sign up or login with your details

Forgot password? Click here to reset