Uncertainty-based Query Strategies for Active Learning with Transformers

07/12/2021
by   Christopher Schröder, et al.
0

Active learning is the iterative construction of a classification model through targeted labeling, enabling significant labeling cost savings. As most research on active learning has been carried out before transformer-based language models ("transformers") became popular, despite its practical importance, comparably few papers have investigated how transformers can be combined with active learning to date. This can be attributed to the fact that using state-of-the-art query strategies for transformers induces a prohibitive runtime overhead, which effectively cancels out, or even outweighs aforementioned cost savings. In this paper, we revisit uncertainty-based query strategies, which had been largely outperformed before, but are particularly suited in the context of fine-tuning transformers. In an extensive evaluation on five widely used text classification benchmarks, we show that considerable improvements of up to 14.4 percentage points in area under the learning curve are achieved, as well as a final accuracy close to the state of the art for all but one benchmark, using only between 0.4

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/21/2021

Small-text: Active Learning for Text Classification in Python

We present small-text, a simple modular active learning library, which o...
research
06/16/2023

ActiveGLAE: A Benchmark for Deep Active Learning with Transformers

Deep active learning (DAL) seeks to reduce annotation costs by enabling ...
research
09/21/2022

Is More Data Better? Re-thinking the Importance of Efficiency in Abusive Language Detection with Transformers-Based Active Learning

Annotating abusive language is expensive, logistically complex and creat...
research
10/06/2022

To Softmax, or not to Softmax: that is the question when applying Active Learning for Transformer Models

Despite achieving state-of-the-art results in nearly all Natural Languag...
research
02/22/2023

Deep Active Learning in the Presence of Label Noise: A Survey

Deep active learning has emerged as a powerful tool for training deep le...
research
02/06/2022

Active Learning on a Budget: Opposite Strategies Suit High and Low Budgets

Investigating active learning, we focus on the relation between the numb...
research
03/14/2022

Uncertainty Estimation for Language Reward Models

Language models can learn a range of capabilities from unsupervised trai...

Please sign up or login with your details

Forgot password? Click here to reset