ActiveGLAE: A Benchmark for Deep Active Learning with Transformers

06/16/2023
by   Lukas Rauch, et al.
1

Deep active learning (DAL) seeks to reduce annotation costs by enabling the model to actively query instance annotations from which it expects to learn the most. Despite extensive research, there is currently no standardized evaluation protocol for transformer-based language models in the field of DAL. Diverse experimental settings lead to difficulties in comparing research and deriving recommendations for practitioners. To tackle this challenge, we propose the ActiveGLAE benchmark, a comprehensive collection of data sets and evaluation guidelines for assessing DAL. Our benchmark aims to facilitate and streamline the evaluation process of novel DAL strategies. Additionally, we provide an extensive overview of current practice in DAL with transformer-based language models. We identify three key challenges - data set selection, model training, and DAL settings - that pose difficulties in comparing query strategies. We establish baseline results through an extensive set of experiments as a reference point for evaluating future work. Based on our findings, we provide guidelines for researchers and practitioners.

READ FULL TEXT
research
07/12/2021

Uncertainty-based Query Strategies for Active Learning with Transformers

Active learning is the iterative construction of a classification model ...
research
04/11/2023

OpenAL: Evaluation and Interpretation of Active Learning Strategies

Despite the vast body of literature on Active Learning (AL), there is no...
research
08/01/2023

ALE: A Simulation-Based Active Learning Evaluation Framework for the Parameter-Driven Comparison of Query Strategies for NLP

Supervised machine learning and deep learning require a large amount of ...
research
12/16/2020

Learning active learning at the crossroads? evaluation and discussion

Active learning aims to reduce annotation cost by predicting which sampl...
research
05/23/2023

EASE: An Easily-Customized Annotation System Powered by Efficiency Enhancement Mechanisms

The performance of current supervised AI systems is tightly connected to...
research
12/20/2022

Smooth Sailing: Improving Active Learning for Pre-trained Language Models with Representation Smoothness Analysis

Developed as a solution to a practical need, active learning (AL) method...

Please sign up or login with your details

Forgot password? Click here to reset