Diversity Enhanced Active Learning with Strictly Proper Scoring Rules

10/27/2021
by   Wei Tan, et al.
0

We study acquisition functions for active learning (AL) for text classification. The Expected Loss Reduction (ELR) method focuses on a Bayesian estimate of the reduction in classification error, recently updated with Mean Objective Cost of Uncertainty (MOCU). We convert the ELR framework to estimate the increase in (strictly proper) scores like log probability or negative mean square error, which we call Bayesian Estimate of Mean Proper Scores (BEMPS). We also prove convergence results borrowing techniques used with MOCU. In order to allow better experimentation with the new acquisition functions, we develop a complementary batch AL algorithm, which encourages diversity in the vector of expected changes in scores for unlabelled data. To allow high performance text classifiers, we combine ensembling and dynamic validation set construction on pretrained language models. Extensive experimental evaluation then explores how these different acquisition functions perform. The results show that the use of mean square error and log probability with BEMPS yields robust acquisition functions, which consistently outperform the others tested.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/22/2021

A Simple Baseline for Batch Active Learning with Stochastic Acquisition Functions

In active learning, new labels are commonly acquired in batches. However...
research
01/10/2021

PowerEvaluationBALD: Efficient Evaluation-Oriented Deep (Bayesian) Active Learning with Stochastic Acquisition Functions

We develop BatchEvaluationBALD, a new acquisition function for deep Baye...
research
08/16/2018

Deep Bayesian Active Learning for Natural Language Processing: Results of a Large-Scale Empirical Study

Several recent papers investigate Active Learning (AL) for mitigating th...
research
01/29/2021

Model Adaptation for Image Reconstruction using Generalized Stein's Unbiased Risk Estimator

Deep learning image reconstruction algorithms often suffer from model mi...
research
05/07/2022

Towards Computationally Feasible Deep Active Learning

Active learning (AL) is a prominent technique for reducing the annotatio...
research
09/23/2021

Active Learning for Argument Strength Estimation

High-quality arguments are an essential part of decision-making. Automat...
research
05/03/2018

Experimental Design via Generalized Mean Objective Cost of Uncertainty

The mean objective cost of uncertainty (MOCU) quantifies the performance...

Please sign up or login with your details

Forgot password? Click here to reset