Deep Bayesian Active Learning for Natural Language Processing: Results of a Large-Scale Empirical Study

08/16/2018
by   Aditya Siddhant, et al.
10

Several recent papers investigate Active Learning (AL) for mitigating the data dependence of deep learning for natural language processing. However, the applicability of AL to real-world problems remains an open question. While in supervised learning, practitioners can try many different methods, evaluating each against a validation set before selecting a model, AL affords no such luxury. Over the course of one AL run, an agent annotates its dataset exhausting its labeling budget. Thus, given a new task, an active learner has no opportunity to compare models and acquisition functions. This paper provides a large scale empirical study of deep active learning, addressing multiple tasks and, for each, multiple datasets, multiple models, and a full suite of acquisition functions. We find that across all settings, Bayesian active learning by disagreement, using uncertainty estimates provided either by Dropout or Bayes-by Backprop significantly improves over i.i.d. baselines and usually outperforms classic uncertainty sampling.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/27/2020

Deep Active Learning for Sequence Labeling Based on Diversity and Uncertainty in Gradient

Recently, several studies have investigated active learning (AL) for nat...
research
02/01/2022

Active Learning Over Multiple Domains in Natural Language Tasks

Studies of active learning traditionally assume the target and source da...
research
06/26/2018

Dropout-based Active Learning for Regression

Active learning is relevant and challenging for high-dimensional regress...
research
06/02/2023

Active Code Learning: Benchmarking Sample-Efficient Training of Code Models

The costly human effort required to prepare the training data of machine...
research
10/26/2020

Inspecting Sample Reusability for Active Learning

Active Learning (AL) exploits a learning algorithm to selectively sample...
research
06/02/2022

BayesFormer: Transformer with Uncertainty Estimation

Transformer has become ubiquitous due to its dominant performance in var...
research
10/27/2021

Diversity Enhanced Active Learning with Strictly Proper Scoring Rules

We study acquisition functions for active learning (AL) for text classif...

Please sign up or login with your details

Forgot password? Click here to reset