Active Learning for Sequence Tagging with Deep Pre-trained Models and Bayesian Uncertainty Estimates

01/20/2021
by   Artem Shelmanov, et al.
0

Annotating training data for sequence tagging tasks is usually very time-consuming. Recent advances in transfer learning for natural language processing in conjunction with active learning open the possibility to significantly reduce the necessary annotation budget. We are the first to thoroughly investigate this powerful combination in sequence tagging. We find that taggers based on deep pre-trained models can benefit from Bayesian query strategies with the help of the Monte Carlo (MC) dropout. Results of experiments with various uncertainty estimates and MC dropout variants show that the Bayesian active learning by disagreement query strategy coupled with the MC dropout applied only in the classification layer of a Transformer-based tagger is the best option in terms of quality. This option also has very little computational overhead. We also demonstrate that it is possible to reduce the computational overhead of AL by using a smaller distilled version of a Transformer model for acquiring instances during AL.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/01/2019

Active Learning with Siamese Twins for Sequence Tagging

Deep learning, in general, and natural language processing methods, in p...
research
10/12/2020

Pre-trained Language Model Based Active Learning for Sentence Matching

Active learning is able to significantly reduce the annotation cost for ...
research
06/02/2022

BayesFormer: Transformer with Uncertainty Estimation

Transformer has become ubiquitous due to its dominant performance in var...
research
08/29/2022

Confidence Estimation for Object Detection in Document Images

Deep neural networks are becoming increasingly powerful and large and al...
research
05/07/2022

Towards Computationally Feasible Deep Active Learning

Active learning (AL) is a prominent technique for reducing the annotatio...
research
07/31/2023

A Pre-trained Data Deduplication Model based on Active Learning

In the era of big data, the issue of data quality has become increasingl...
research
10/12/2022

Fast Bayesian Updates for Deep Learning with a Use Case in Active Learning

Retraining deep neural networks when new data arrives is typically compu...

Please sign up or login with your details

Forgot password? Click here to reset